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ABSTRACT 

We present the Planck Catalogue of Galactic Cold Clumps (PGCC), an all-sky catalogue of Galactic cold clump candidates detected 
by Planck. This catalogue is the full version of the Early Cold Core (ECC) catalogue, which was made available in 2011 with the Early 
Release Compact Source Catalogue (ERCSC) and contained 915 high signal-to-noise sources. It is based on the Planck 48 months 
mission data that are currently being released to the astronomical community. The PGCC catalogue is an observational catalogue con¬ 
sisting exclusively of Galactic cold sources. The three highest Planck bands (857, 545, 353 GHz) have been combined with IRAS data 
at 3 THz to perform a multi-frequency detection of sources colder than their local environment. After rejection of possible extragalac- 
tic contaminants, the PGCC catalogue contains 13188 Galactic sources spread across the whole sky, i.e., from the Galactic plane to 
high latitudes, following the spatial distribution of the main molecular cloud complexes. The median temperature of PGCC sources 
lies between 13 and 14.5 K, depending on the quality of the flux density measurements, with a temperature ranging from 5.8 to 20 K 
after removing sources with the 1% largest temperature estimates. Using seven independent methods, reliable distance estimates have 
been obtained for 5574 sources, which allows us to derive their physical properties such as their mass, physical size, mean density and 
luminosity. The PGCC sources are located mainly in the solar neighbourhood, up to a distance of 10.5 kpc towards the Galactic centre, 
and range from low-mass cores to large molecular clouds. Because of this diversity and because the PGCC catalogue contains sources 
in very different environments, the catalogue is useful to investigate the evolution from molecular clouds to cores. Finally, the catalogue 
also includes 54 additional sources located in the Small and Large Magellanic Clouds. 

Key words. Submillimetre: ISM - ISM: clouds - ISM: structure - Galaxy: local interstellar matter - Stars: formation 


- 1. Introduction 

* Corresponding author: Ludovic Montier, 

Ludovic.Montier@irap.omp.eu The all-sky Planck 1 mission has opened up the possibil¬ 

ity of carrying out comprehensive investigations of the 


1 Planck (http://www.esa.int/Planck) is a project of thj 
European Space Agency (ESA) with instruments provided by two 
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Fig. 1 . All-sky distribution of the PGCC sources: 13188 Galactic clumps ( black dots), plus 54 Large Magellanic Cloud 
(LMC) and Small Magellanic Cloud (SMC) clumps ( grey dots). The source distribution is overlaid on the 857 GHz Planck 
map, shown in logarithmic scale between 10 2 to 10 2 MJysr -1 . 


Galactic emission components. With its high sensitivity 
and wide wavelength coverage, Planck is providing all¬ 
sky maps of the thermal dust emission and, in particular, 
of the emission arising from cold dust. Because cold dust 
is mainly associated with dense regions within molecular 
clouds, these observations are relevant for studies of the 
early phases of star formation, in particular to explore how 
star formation depends on the physical conditions provided 
by the parent cloud. To this end, it is necessary to investi¬ 
gate the spatial distribution and physical properties of dense 
clumps in different Galactic environments, and this objec¬ 
tive can be attained only by extended surveys, which can 
cover the full range of scales encompassed by the star for¬ 
mation process, i.e., from subparsec to several kpc. 

During the past decade, new insights on the study of 
cold sources have been provided by sophisticated numerical 
modeling and by the development of sensitive millimetre 
and submillimetre detectors, operating both from space and 
from the ground, and with both imaging or spectroscopic 
capabilities (e.g.. Sect. 1 in Planck Collaboration XXIII, 
2011). By combining the highest frequency channels of the 
Planck survey (353-857 GHz, 350-850/rm) with the far- 
infrared IRAS (Neugebauer et ah, 1984) data, and by ap¬ 
plying a dedicated source detection method, which lever¬ 
ages on the cold sources spectral signature, we can obtain 
an all-sky census of the Galactic coldest objects. In partic¬ 
ular, the method of Montier et al. (2010) makes it possi¬ 
ble to separate cold and warm dust emission components, 
and to derive the physical properties (flux density, size of 
the emitting region, temperature) of the cold component. 
Furthermore, Planck has provided the first uniform submil¬ 
limetre surveys that cover both the Galactic plane and re¬ 
gions at intermediate and high latitudes, which allows us 

scientific consortia funded by ESA member states and led by 
Principal Investigators from France and Italy, telescope reflectors 
provided through a collaboration between ESA and a scientific 
consortium led and funded by Denmark, and additional contribu¬ 
tions from NASA (USA). 


to expand the physical parameter space probed by the pre¬ 
viously known cold sources. The Planck detected sources 
span a broad range of temperature, mean density, mass, and 
size. The most compact and nearby sources have a linear di¬ 
ameter of ~ 0.1 pc. At large distances, though, and because 
of the limited instrument resolution, many sources have an 
intrinsic size of tens of parsec. More importantly, the av¬ 
erage Planck cold clump, with a linear diameter of 1 pc, 
is typically characterized by the presence of sub-structures, 
each corresponding to individual cores, as revealed by the 
Herschel follow-up (Juvela et al., 2010,2011, 2012; Planck 
Collaboration XXII, 2011; Montillaud et al., 2014 submit¬ 
ted). The Herschel observations also highlighted that the 
Planck sources likely correspond to different evolution¬ 
ary stages, with half of the targeted fields showing signs 
of active star formation, as indicated by the presence of 
mid-infrared point sources. In addition, the Herschel high- 
angular resolution has allowed us to shed light on the fila¬ 
mentary nature of a substantial fraction of Planck clumps 
and has evidenced that, in one case out of ten, the clumps 
have a cometary shape or a sharp boundary indicative of 
compression by an external force (Juvela et al., 2012). 

As part of the first Planck data release, the sample of 
the most robust Planck detections has been already de¬ 
livered to the astronomical community. This Early Cold 
Clump sample (ECC) included 915 Planck cold clumps (at 
T < 14 K) that are distributed over the whole sky (Planck 
Collaboration VII, 2011; Planck Collaboration, 2011). We 
are now providing the entire catalogue of cold sources, i.e., 
the Planck Catalogue of Galactic Cold Clumps (PGCC), 
based on the full Planck 2014 data release over the all-sky 
and shown in Fig. 1. 

In this paper, we describe the generation and content of 
the PGCC catalogue. A detailed analysis of the cold source 
population contained in the catalogue will be presented in 
forthcoming papers. In Sect. 2 we describe the data as well 
as the source detection and extraction method. In Sect. 3 
we discuss the generation of the catalogue, including the 
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applied quality flags for the source selection, and the photo¬ 
metric measurements. In Sect. 4 we discuss the source val¬ 
idation process of the detection algorithm, based on a sta¬ 
tistical analysis. In Sect. 5 we present the different methods 
used to derive distance estimates for the clumps. In Sect. 6 
we describe the derivation of other physical properties of 
the sources such as mass and luminosity. Finally, in Sect. 7, 
we provide details on the cross-matching of the final cat¬ 
alogue with ancillary catalogues and complementary data 
sets. 

2. Source detection and photometry 

2.1. Data set 

This paper is based on the whole observing time of the 
Planck mission, corresponding to the 5 all-sky surveys. 
Here we approximate the Planck beams by using effective 
circular Gaussians (Planck Collaboration IV, 2014; Planck 
Collaboration VI, 2014). The Full-Width-Half-Maximum 
(FWHM) at each frequency channel is given in Table 1. In 
addition, in this work, we focus on the three highest Planck 
frequency channels, i.e., 857, 545, and 353 GHz, which are 
designed to cover the Galactic cold dust emission peak. 
The 217 GHz band has not been included in our analysis, 
despite being characterized by an angular resolution com¬ 
parable to the other bands, and this is for two reasons: i) 
this band is strongly contaminated by the CO /=2—> 1 emis¬ 
sion line, as described in Planck Collaboration IX (2014), 
and this contribution is expected to be significant towards 
dense regions, given their associations with molecular ma¬ 
terial; ii) the contamination by the cosmic microwave back¬ 
ground may become problematic in this band at high lat¬ 
itude, implying a complex component separation issue. 
The noise in the channel maps is assumed to be Gaussian 
with a standard deviation of 1.55xl0~ 2 , 1.49xl0~ 2 and 
1,4x 10 -3 MJy sr -1 at 857, 545, and 353 GHz, respectively 
(Planck Collaboration I, 2014). The absolute gain calibra¬ 
tion of High Frequency Instrument (HFI) maps is about 
1.2%, 6.08%, and 6.33% at 353, 545, and 857 GHz, re¬ 
spectively (see Table 6 in Planck Collaboration I, 2014). 
Further details on the data reduction, Planck frequency 
maps and the calibration scheme can be found in Planck 
Collaboration VI (2014). 

The Planck data are combined with the IRIS all-sky 
data (Miville-Deschenes & Lagache, 2005), i.e., a repro¬ 
cessed version of the IRAS data. As described in Planck 
Collaboration VII (2011) and Planck Collaboration XXIII 
(2011), the IRIS 3THz (100 pm) data have been cho¬ 
sen because they allow us to complement the Planck data. 
In fact: i) 3THz is a very good tracer of Galactic warm 
(~ 20 K) dust; ii) the emission from small grains does not 
contribute substantially at this frequency; iii) the IRIS (4.'3) 
and Planck data angular resolutions are very similar (see 
Table 1). Note that the IRAS survey coverage presents two 
gaps, which in total account for 2% of the whole sky. In 
IRIS data, these gaps were filled in by using lower angu¬ 
lar resolution DIRBE data (~ 40')- Due to the discrepancy 
in resolution between IRIS and DIRBE , these regions have 
been excluded from our analysis. Furthermore, sources de¬ 
tected by our algorithm close to the location of the gaps 
were carefully examined, since they might be contaminated 
by noisy features in the IRIS 3 THz map. 


Table 1 . FWHM of the effective beam of the IRAS and 
Planck high frequency channel maps. 


Frequency 

[GHz] 

Wavelength 

[jum] 

FWHM 

□ 

353 

850 

4.818 ± 0.024 

545 

550 

4.682 ± 0.044 

857 

350 

4.325 ± 0.055 

3000 

100 

4.300 ± 0.200 


All Planck and IRIS maps have been convolved to the 
same resolution, 5' FWHM, before performing source de¬ 
tection and extraction. 

2.2. Detection method 

To detect cold sources in the combined Planck and IRIS 
3 THz maps, we applied the CoCoCoDeT detection algo¬ 
rithm presented in Planck Collaboration XXIII (201 1), and 
described in Montier et al. (2010). This algorithm is based 
on a multi-frequency approach which exploits the specific 
colour properties of this type of sources. The detection is 
performed independently at 857, 545, and 353 GHz using 
the cold residual maps, which are built by subtracting a 
warm component from each frequency map. This warm 
component is estimated separately in each pixel by extrap¬ 
olating a warm template, i.e., the IRIS 3 THz map, to a 
given Planck frequency v, using the local average back¬ 
ground colour estimated at 3 THz and v, and computed in 
an annulus from 5' to 15' centred on the pixel. The cat¬ 
alogues obtained in each of the three Planck bands, 857, 
545, and 353 GHz, are then merged by requiring: i) a detec¬ 
tion in each band on the cold residual maps; ii) a signal-to- 
noise ratio (S/N) greater than 4 in all bands; iii) a maximum 
distance between the centres of the three detections of 5'. 
These criteria assure cross-band detection consistency as 
well as source compactness. More details can be found in 
Planck Collaboration XXIII (201 1). 

We emphasize that our method differs from classical de¬ 
tection algorithms that typically perform the detection di¬ 
rectly on frequency maps, as for instance is the case for 
the Planck Catalogue of Compact Sources (PCCS Planck 
Collaboration XXVIII, 2014). Our method allows a detec¬ 
tion in temperature : cold sources show a positive signal in 
the cold residual maps, while warm sources show a nega¬ 
tive signal. More precisely, this technique allows us to en¬ 
hance sources having a temperature lower than the local 
background. This does not automatically imply that the de¬ 
tected sources are intrinsically cold: for example, a source 
could be detected as cold simply because it is seen against a 
very warm background (or foreground). This is the typical 
case of objects located along the line-of-sight of active star 
forming regions. In building the catalogue, this effect has 
been taken into account, as discussed in Sect. 3.4. 

2.3. Photometry 

The flux density of the cold clumps has been estimated from 
the IRIS and Planck bands using the algorithm described 
in Planck Collaboration XXIII (2011). Here we recall the 
main steps of the method: i) determination of the clump 
size and position by means of an elliptical Gaussian fit of 
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the cold residual map at 857 GHz at the location where the 
S/N is the highest; ii) polynomial fit of the background sur¬ 
face at 3 THz and removal of the cold component from the 
3THz warm template before extrapolation to the Planck 
bands; iii) aperture photometry in all bands using the el¬ 
liptical profile. We provide two estimates of the flux den¬ 
sity in each band, one for the cold clump and one for its 
associated warm background. The estimate for the clump 
is based on the cold residual maps, after subtraction of the 
warm component, while the warm background flux density 
is computed from the warm component, extrapolated from 
3 THz and integrated over a solid angle with the same size 
as the cold clump. The sum of these two values gives the 
total flux density of the cold clump in the original IRIS and 
Planck maps. 

The uncertainty on the flux density estimates have been 
obtained by performing Monte-Carlo simulations. In par¬ 
ticular, for each source we inject one at a time, in an an¬ 
nulus extending from 10' to 30' centred on the source it¬ 
self and in both the IRIS and Planck cold residual maps, 
artificial sources with the same flux density and elliptical 
shape. This operation allows us to preserve the instrumen¬ 
tal noise and confusion level of the original maps. The un¬ 
certainty on the true source photometric measurement is 
then given by the standard deviation of the flux densities 
of the artificial sources when the same photometric steps 
are applied to both the true and artificial sources. We em¬ 
phasize that our flux density uncertainties are likely slightly 
conservative, as the confusion arising from injecting artifi¬ 
cial sources in the proximity of the true source generates 
additional noise and this term is included in our calcula¬ 
tions. More details can be found in Planck Collaboration 
XXIII (2011). We have performed a Monte Carlo Quality 
Assessment (MCQA) to evaluate the overall quality of our 
photometric measurements and this is presented in Sect. 4. 
As in Planck Collaboration XXIII (201 1), we assign to the 
catalogued sources quality flags to indicate the accuracy on 
the estimated flux densities and sizes, and these flags are 
used to divide the sources into three categories of increas¬ 
ing flux quality, as described in Sect. 3.1. 

Notice that a minimum distance of 5' between two 
sources was required for detections at 857, 545, and 
353 GHz. However, band-merging and elliptical Gaussian 
fitting both modify the final centroid coordinates of the 
clumps, which therefore may not longer satisfy the 5' crite¬ 
rion. Furthermore, elongated clumps may partially overlap 
even at a distance greater than 5' between them. Because 
Galactic very sources are preferentially found highly struc¬ 
tured regions of the Galaxy, where confusion is significant, 
we have to face severe blending issues. After obtaining the 
elliptical Gaussian profile of all the sources, we compute 
the overlap between a given source and all its neighbours 
located within 15'. All sources with a non-zero overlap are 
flagged (FLUX_BLENDING), and further information is pro¬ 
vided, as detailed in Sect. 3.2. 

3. Catalogue generation 

In this section, we describe the final selection of sources, 
starting from the source list generated by the detection al¬ 
gorithm. In particular, we discuss how we have increased 
the reliability of the catalogue by rejecting spurious sources 
and extragalactic contaminants. Finally, we provide details 
on the catalogue content. 


3.1. Selection based on photometric quality 

We have seen that the detection algorithm described in 
Sect. 2.2 has been applied to the combined Planck and IRIS 
3 THz data. After rejection of the spurious detections ob¬ 
tained in the proximity of the gaps in the IRAS map, we 
are left with 13832 sources. As a first validation step, we 
have used the quality of the photometric measurements dis¬ 
cussed in Sect 2.3 to identify additional spurious sources. 
Following this procedure, 428 detections (~ 3% of the to¬ 
tal) are rejected because of highly inaccurate photometry. 
The typical case of rejected sources at this stage is that of 
sources with a negative flux density estimate, which can be 
caused by the presence of stripes in the data. The 13404 re¬ 
maining sources have been divided into three categories, 
according to the quality of their flux density values (see 
FLUX_QUALITY flag). The three categories are: 

category 1 : "Reliable flux densities" : sources with flux 
densities at S/N> 1 in both Planck (857, 545, and 
353 GHz) and IRIS 3 THz bands, allowing a full char¬ 
acterization of their colour ratio and their temperature, 
as required for their validation. These sources represent 
the highest quality sample of the PGCC catalogue, and 
have FLUX_QUALITY=1. 

category 2: "Missing 3 THz flux density" : sources with 
flux densities S/N> 1 in all bands except for the IRIS 
3 THz band, where we have obtained only an upper 
limit. These sources are typically characterized by low 
flux densities and extremely cold temperatures, and 
have no detectable counterparts in the infrared. They 
are potentially interesting very cold clump candidates, 
and have FLUX_QUALITY=2. 

category 3: " Detection only" : sources for which the qual¬ 
ity of the elliptical Gaussian fit is very poor, thus no 
reliable flux density estimate can be obtained. These 
sources are likely extended or embedded in a complex 
environment. They have FLUX_QUALITY=3. 

We provide the number of sources in each category 
in the first row of Table 2. After removal of the contami¬ 
nants, such as sources affected by the presence of nearby 
hot sources (see Sect. 3.4) or extragalactic objects (see 
Sect. 3.3), we obtain the final numbers of sources shown 
in the last row of the Table. The all-sky distributions of the 
sources in each FLUX_QUALITY (hereafter FQ) category are 
shown in Fig. 2. 

3.2. Blending issue 

A total number of 1757 sources are affected by blend¬ 
ing, i.e., their flux density estimates have been compro¬ 
mised by the presence of a nearby and partly overlapping 
source. In this case, the flag FLUX_BLENDING is raised, and 
an approximate estimation of the contamination level is 
performed. This is expressed in terms of a relative bias 
(FLUX_BLENDING_BIAS) of the original flux density esti¬ 
mate in each band, when flux density estimates are avail¬ 
able for both involved sources. The median value of this 
bias is around -37%, although it varies greatly from one 
source to another. This bias is only indicative and cannot be 
used to correct the flux density estimates. A more accurate 
estimate of these flux densities could be obtained by per¬ 
forming a detailed analysis on individual sources, which, 
in particular, should take into account the local background 
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Fig. 2. All-sky distribution of the PGCC sources according to their FLUX_QUALITY category: "Reliable flux densities" 
(FQ=1, top panel), "Missing 3 THz flux density" (FQ=2, middle panel ) and "Detection only" (FQ=3, bottom panel). 
Sources located in the LMC and SMC are also shown in light colours. 


5 





























Planck Collaboration: PGCC catalogue 


fluctuations, hence the relative contribution of each source 
component to the integrated flux density. A visual inspec¬ 
tion to the ID cards of the blended sources may help to get 
an idea of the complexity of each case. For each source im¬ 
pacted by blending, we also provide the catalogue index of 
the companion source (FLUX_BLENDING_IDX) with the an¬ 
gular distance to its centroid (FLUX_BLENDING_ANG_DIST). 

3.3. Extragalactic contamination 

The goal of the PGCC catalogue is to contain a selection as 
large as possible of Galactic cold clump candidates. Since 
the CoCoCoDeT detection algorithm is applied to all-sky 
maps, it is possible to have contamination from extragalac¬ 
tic objects. Hereafter we describe the three independent 
methods that we have used to identify and reject this type 
of contaminants. 

The first step consists of a colour-colour selection of 
’radio-type’ objects, characterized by a flat spectral en¬ 
ergy distribution (SED) in the submillimetre and millimetre 
wavelength range. This kind of objects may have been de¬ 
tected by our algorithm because of the flattening of their 
SED around 857 and 545 GHz, which tends to mimic a 
cold black body spectrum. In this case, we have used the 
353 GHz to 545 GHz flux density ratio to discriminate be¬ 
tween radio-emitting and other type of objects. We found 
26 objects with a ratio S 353 /S 545 > 0.9, typical of an ex¬ 
tremely flat or increasing SED in the millimetre domain. 

The second step consists in cross-correlating the 
PGCC catalogue with extragalactic catalogues, such as: 
the Messier (Messier & Niles, 1981) catalogue, the NGC 
(Dreyer, 1888) and IC (Dreyer, 1895) catalogues of nearby 
galaxies, and the 3C (Edge et al., 1959; Bennett, 1962) 
and 4C (Pilkington & Scott, 1965; Gower et al., 1967) 
catalogues of quasars. The cross-correlation has been per¬ 
formed using a 5' radius, leading to 66 found associations 
between a cold clump and an extragalactic object. 

In the last step, we have searched for possible opti¬ 
cal counterparts in the Digitized Sky Survey (DSS) data 
(Djorgovski et al., 2003). The whole sample of cold clump 
candidates has been visually inspected to look for extended 
and/or bright sources in DSS images located close to or 
at the cold clump coordinates. After selecting some 800 
sources with potential DSS counterparts, for each of these 
we have carefully examined ancillary Dame et al. (2001) 
CO data, extinction maps obtained with the NICER algo¬ 
rithm (Lombardi, 2009) and cold residual maps. In addi¬ 
tion, we have also searched for possible counterparts in the 
Planck Low Frequency Instrument (LFI) data. The combi¬ 
nation of all these data sets has allowed us to identify 43 
sources that are likely of extragalactic origin. 

Finally, we have merged the three samples of extra¬ 
galactic contaminants and rejected 114 unique objects from 
the initial source list. 

The CoCoCoDeT algorithm has also detected sources 
in the Large Magellanic Cloud (LMC) and the Small 
Magellanic Cloud (SMC), which have not been rejected 
from the catalogue. Because of the proximity of these two 
galaxies, the Planck resolution and sensitivity allow the 
detection of individual cold clumps, forming a potential 
very interesting sample. Hence 51 Planck clumps falling 
inside a radius of 4?09 centred on the Galactic coordinates 
(279.03, -33.60), as defined by Staveley-Smith et al. (2003), 
are flagged (XFLAG_LMC) as part of the LMC, and 3 oth¬ 


ers falling inside a radius of 2?38 centred on the Galactic 
coordinates (302.67, -44.46), following Stanimirovic et al. 
(1999), are flagged (XFLAGJ3MC) as part of the SMC. 
Remarkably, follow-up observations of the LMC and SMC 
Planck clump candidates have confirmed the nature of these 
objects, as discussed in Appendix A. 

3.4. Nearby hot source contamination 

The detection algorithm CoCoCoDeT (Montier et al., 2010) 
used to extract the cold clumps from the Planck and IRIS 
data is designed to detect sources colder than a median 
background estimated in the neighbourhood of the source. 
As stressed in Sect. 2.2, this method allows us to detect cold 
regions embedded in a warm background, but it can also 
yield detections of extended envelopes of warm sources that 
appear colder than their environment but are not intrinsi¬ 
cally cold. In some extreme cases, a hot source can cause an 
overestimation of the background temperature in its prox¬ 
imity and thus lead to spurious detections. 

In order to investigate this type of contamination, we 
use the cold residual maps as an indicator of the warm 
background around the source and look for negative con¬ 
tiguous pixels. By definition, where the cold residual is 
positive, the relative temperature is colder than the back¬ 
ground, and the other way round, i.e., where the residual 
is negative, the temperature is higher. Thus we can build a 
list of hot point sources by using the same detection algo¬ 
rithm as for the cold clumps, but this time by applying it to 
the reverse of the cold residual maps. We then compute the 
minimum distance between any cold clump candidate and 
hot source detections in a 15' radius from the cold clump 
coordinates centre. This yields 2464 cold-hot associations 
with distances ranging from 2.'6 to 15'. 

We emphasize that the presence of hot sources in the 
proximity of cold sources does not lead systematically to 
spurious detections. This kind of association is expected in 
star formation regions, as both pre- and proto-stellar cores 
often reside in the same molecular cloud. In fact, the forma¬ 
tion of cold and compact condensations may even be trig¬ 
gered by nearby star formation. For this reason, we only 
reject cold clump detections that are associated with a hot 
source located inside a 5' radius from the cold clump coor¬ 
dinate centre (48 sources) and we flag the other 2416 cases 
while providing the distance between the centre of the cold 
clump and the hot source (NEARBY_H0T_S0URCE). 

3.5. Description of the catalogue content 

The final PGCC catalogue counts 13188 Galactic sources 
and 54 sources located in the LMC and SMC, divided 
into three categories of flux density quality (FLUX_QUALITY 
flag). The number of sources in each category is given in 
Table 2. The Table also provides the number of sources 
that are flagged due to the presence of nearby hot sources 
(NEARBY_H0T_S0URCE flag) or because they are located in 
the LMC and SMC. The all-sky distribution of the PGCC 
sources, overlaid on the 857 GHz Planck data, is shown in 
Fig. 1, while Fig. 2 shows the distribution per flux density 
quality flag. The columns in the catalogue and their mean¬ 
ing is given in Tables C.l and C.2 in Appendix C. The 
distance estimates and related physical properties are de¬ 
scribed in Sects. 5 and 6 . The PGCC catalogue is available 
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Table 2. Description of the PGCC content, providing the 
total amounts of initial, rejected, final and flagged sources, 
and split in each category of the flux density quality. 



FLUX.QUALITY 

Total 


1 

2 

3 


Initial. 

7062 

3833 

2509 

13404 

Extragalactic sources . . 

46 

40 

28 

114 

LMC/SMC . 

19 

25 

10 

54 

Nearby hot sources . . . 

4 

13 

31 

48 

Final . 

6993 

3755 

2440 

13188 

Flag nearby hot sources 

758 

1025 

633 

2416 

Flag blending . 

726 

528 

503 

1757 

With distance estimate . 

2940 

1686 

948 

5574 


online 2 , together with 30’x30’ cutouts, respectively, from 
the IRIS 3 THz, the Planck frequency map, and the Planck 
cold residual maps. 

4. Quality assessment 

4.1. MCQA simulations 

We have carried out a Monte Carlo Quality Assessment 
(MCQA) to quantify the performance of the detection al¬ 
gorithm applied to the combined IRIS 3 THz and Planck 
data set. To this end, we have generated all-sky simula¬ 
tions by injecting artificial sources in the IRIS and Planck 
maps, and then applied the CoCoCoDeT algorithm described 
in Sect. 2.2. In total, 150000 sources have been injected 
over the whole sky, divided into 15 sky realizations. Each 
source is characterized by a temperature, a fixed emissivity 
spectral index /3-2, a flux density at 857 GHz, and an ellip¬ 
tical Gaussian profile (major and minor axis, ellipticity and 
position angle). The simulated sources are randomly dis¬ 
tributed across the sky, using a uniform spatial distribution, 
at a minimum distance of 12' from the true Planck cold 
clump centre coordinates. 

The synthetic temperatures range from 6 to 20 K, while 
the synthetic flux densities at 857 GHz follow a uniform 
distribution in logarithmic scale between 1.5 Jy and 500 Jy, 
indicating that we inject more faint sources that are effec¬ 
tively detected. Temperatures and flux densities are inde¬ 
pendently assigned to a value, i.e., no functional relation is 
assumed between these two quantities. The ellipticity varies 
between 1 and 2, and 6 ranges from 5' to 7'. 

We note that these simulations do not intend to accu¬ 
rately reproduce the Planck cold clump properties. Their 
goal is rather to cover entirely the physically accept¬ 
able parameter space, in order to allow us to recover the 
CoCoCoDeT transfer function. 

The detection and extraction procedure, described in 
Sects. 2.2 and 2.3, is then applied to the simulated data 
set, and a catalogue of detected sources, with corresponding 
flux densities and FLUX_QUALITY flag, is built. 

4.2. Completeness 

Completeness is defined as the ratio of the number of de¬ 
tected sources to the total number of injected sources. Using 

2 ESA PGCC link 
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Fig. 3. Completeness as a function of the input tempera¬ 
ture (top panel), latitude (middle panel), and mass (bottom 
panel) of the injected sources. For the latitude and mass 
cases, the input temperature was lower than 17 K. In ad¬ 
dition, when considering the dependence of completeness 
on mass, all the sources are assumed to be at 100 pc from 
the Sun. Each panel shows the distributions obtained from 
selecting only the sources in a given FLUX_QUALITY cate¬ 
gory. In particular, FQ= 1, 2 and 3 correspond to the blue, 
green and pink curves, respectively. Finally, the dotted line 
in each panel denotes the completeness for the sample of 
sources with input flux density S 857 > 15 Jy. 

the Monte Carlo analysis described in the previous section, 
we have investigated whether the completeness of the cata¬ 
logue generated with the CoCoCoDeT algorithm depends on 
temperature. Galactic latitude, flux density and mass. 


7 































Planck Collaboration: PGCC catalogue 






Fig. 4. Completeness (computed on a restricted sample of sources with injected temperature below 17 K) as a function 
of the injected flux density in the IRAS 3 THz (top left panel) and Planck upper frequency bands, 857 GHz ( top right 
panel), 545 GHz ( bottom left panel), and 353 GHz ( bottom right panel). It is shown per-category of flux density quality, 
i.e., FQ= 1,2 and 3 categories (blue, green and pink, respectively). The completeness is also computed on a more restricted 
sample of sources with injected temperature below 17 K and with a flux density accuracy better than 50% ( dashed lines). 


The top panel of Fig. 3 shows the relation between 
catalogue completeness and temperature of the injected 
sources. The completeness is about 60% ( black solid line) 
for temperatures lower than 10 K, while it drops below 1 % 
for temperatures larger than 17 K. In addition, the com¬ 
pleteness increases to almost 90%, for temperatures below 
10 K if we consider only sources with input flux densities 
S 857 > 15 Jy (dotted line). This result confirms that 
t CoCoCoDeT is a method optimized to detect cold sources 
embedded in a warm environment, while rejecting warm 
sources. We also note that sources flagged with FQ=3, (i.e.. 
Detection only) are found in correspondence of relatively 
warm temperatures (between 12 and 18 K), indicating that, 
as discussed in Sect. 3.1, they might not be cold clumps. On 
the other hand, the completeness of the sources with FQ=2 
(i.e.. Missing 3 THz flux) increases towards lower tempera¬ 
tures, further suggesting that they are probably very cold. 
Notice that the completeness of sources with very low tem¬ 
perature (close to 6 K) is still about 60%, which means that, 
if these sources indeed exist, our algorithm is able to de¬ 
tect them. Finally, the completeness of the most reliable set 
of sources (FQ= 1, blue) appears to peak (at 40%) around 
12 K, ranging from 17 K to 6 K, i.e., the floor of the tem¬ 
perature distribution of the injected sources. Because the 
detection efficiency drops to zero beyond 17 K, in the fol¬ 
lowing we limit the discussion to simulated sources with 
temperatures lower than this threshold. 


The middle panel of Fig. 3 illustrates the complete¬ 
ness as a function of Galactic latitude. Outside the Galactic 
plane, the completeness cumulated over all FQ categories 
remains quite constant and around 60%, while it drops to 
30% for |i>| < 10°. This effect is expected and due to confu¬ 
sion in the Galactic plane. Sources with FQ= 1 and 2 present 
a similar behaviour. On the contrary, the simulated sources 
with FQ=3 are mainly detected outside the plane, as ob¬ 
served for the real PGCC sources in this flux category (see 
bottom panel of Fig. 2). 

We have also explored the dependence of the complete¬ 
ness on the injected flux density in the IRIS 3 THz and 
Planck bands. The result is in Fig. 4. The completeness of 
the FQ=3 sources increases with flux density, especially at 
the two highest frequencies. This is indeed the behaviour 
we expect from relatively warm sources. Conversely, the 
completeness of the FQ=2 sources, which are presumably 
very cold, peaks at bright flux densities in the two lower fre¬ 
quencies, and at faint flux densities (below 1 Jy) at 3 THz, 
where it has not been possible to get any output flux density 
estimates. The completeness at 3 THz of the FQ= 1 sources 
that, by definition, have S/N > 1, is about 20% below the 
IRIS sensitivity limit (1 Jy), and may appear inconsistent 
with it. This comes from the fact that completeness in a 
given band is here defined based on the input source flux 
density rather than on the output one. If we estimate the 
completeness by using only the sources for which the re- 
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Fig. 5. Density distribution of the positional offsets com¬ 
puted from 100 000 Monte Carlo realizations. Circles show 
the cumulative distributions at 68% (solid line), 90% (long- 
dashed line), 95% (dashed line) and 99% (dotted line). 

covered flux has an accuracy of 50% or more (dashed line), 
we obtain a result in agreement to within lcr with the IRIS 
sensitivity limit, i.e., it drops to 0% for flux densities below 
0.5 Jy. Interestingly, the completeness drops to 0% for S 545 
and S 353 below 1 Jy and 0.3 Jy, respectively, for all FQ cat¬ 
egories: these two bands define the detection limit of our 
catalogue. 

This flux density limit can be converted into a mass de¬ 
tection limit if we assume that all the sources are located 
at 100 pc from the Sun. The completeness as a function 
of mass is shown in the bottom panel of Fig. 3. The cat¬ 
alogue appears very incomplete for sources with a mass be¬ 
low 0.1 Mo, which explains the lower cutoff of the PGCC 
mass distribution in Fig. 11. At the same time, for sources 
with FQ=1 the catalogue is complete at the 20% - 40% 
level across the entire mass range, meaning that no mass 
selection is introduced by CoCoCoDeT. Therefore, the only 
(significant) bias introduced in the PGCC mass distribu¬ 
tion originates from the availability of distance estimates. 
Finally, the simulated FQ=2 detections are characterized 
by a higher degree of completeness for relatively high mass 
values, as also observed also for the PGCC sources. 

4.3. Geometric accuracy 

By geometry, we mean the ensemble of parameters (e.g., 
centroid, ellipticity, position angle and equivalent full width 
half maximum) that describes the location, size and orienta¬ 
tion of the source. The accuracy of these parameters is cru¬ 
cial for accurate photometric measurements, as described 
in Sect. 2.3. 

Figure 5 summarizes the catalogue positional accuracy, 
which we define as the offset between the input and the 
recovered centroid of the synthetic sources. The cumula¬ 
tive distributions are shown at four confidence levels: 68%, 
90%, 95% and 99%. 68% of the sources have a recovered 
centroid with a 0.'2 uncertainty, and 95% with a 0.'8 uncer- 
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Fig. 6. Ratio of the recovered (OUT) to injected (IN) geo¬ 
metric parameters for the simulated sources detected with 
FQ= 1 and 2 : 6 (top), position angle ( middle) and ellipticity 
(bottom). The ratio is given as a function of the detection 
S/N estimated from the cold residual maps. The light and 
dark grey shaded regions (denoted by the dot-dashed and 
long-dashed lines, respectively) highlight the behaviour of 
95% and 68% of the sources, respectively, in each S/N bin. 
The median of the ratio distributions are shown with a solid 
line. The 0%, 10% and 50% levels of uncertainty are over¬ 
laid using a red solid, dashed and dotted line, respectively. 

tainty. The median of the position offset distribution is 10". 
Thus, despite background confusion, our catalogue appears 
to contain accurate source coordinates. 

We have also checked the accuracy for the other geo¬ 
metric parameters. For this purpose, we have investigated 
the relation between the ratio of the recovered to injected 
quantity (OUT/IN) and the S/N. The result is illustrated in 
Fig. 6. 

For each bin in S/N, we have computed the cumula¬ 
tive OUT/IN distributions and corresponding median (solid 
line): in Fig. 6, each panel shows 68% (light shaded con¬ 
tours) and 95% (dark shaded contours) of the sources at 


9 








































Planck Collaboration: PGCC catalogue 


a given frequency. The uncertainty on the recovered posi¬ 
tion angle (Fig. 6, middle panel) and ellipticity (Fig. 6, bot¬ 
tom panel ) is very good, and below 10% for 68% of the 
sources at S/N > 6. When we include 95% of the sources at 
S/N > 6, the uncertainty varies between 20 and 30%. Note 
that for S/N >10, the uncertainty on the recovered parame¬ 
ters remains fairly constant up to very high S/N. The recon¬ 
structed size of the sources, 6 , (Fig. 6, top panel) appears 
more uncertain at all S/N, with an uncertainty of about ±15 
to 20% for 68% of the data at S/N > 6, and +30% of uncer¬ 
tainty when we include 95% of the data. More importantly 
the recovered 9 is systematically underestimated by about 
10% compared to the injected one, due to background. This 
effect has a direct impact on the photometric accuracy, as 
discussed in the following section. 

4.4. Photometric accuracy 

Following a procedure similar to the one applied in the 
case of the geometric parameters, we have estimated the ac¬ 
curacy of the photometric measurements. Accordingly, we 
have computed the ratio of the recovered (OUT) to injected 
(IN) flux densities and analysed its behaviour as a function 
of S/N. We have limited the analysis to sources with FQ= 1 
at 3 THz and FQ= 1 and 2 in the Planck bands. 

The poorest performance appears to be in the 3 THz 
band. The S/N has to be as large as 15 for the accuracy 
to reach a level of about 50% for 68% of the data, while 
the uncertainty is higher by a factor 4 if we include 95% of 
the distribution. At lower S/N (< 15), the OUT/IN ratio can 
reach a factor of 10, while, by definition of FQ=1, it should 
be below 2, because of the S 9^1/ctcout > 1 contraint. Thus, 
the uncertainty at 3 THz is severely underestimated during 
the photometry measurement. 

If we compute the ratio of the difference between the 
recovered and injected flux density to flux density uncer¬ 
tainty, (SggJ - S "J,,, )/<r s ouT, we find a mean value of 5. We 
interpret this result as due to the fact that, at this frequency, 
the performance of the photometric measurements is domi¬ 
nated by modeling uncertainties, such as the removal of the 
warm background. However, this effect does not introduce 
any bias on the flux density measurements at 3 THz. 

The photometric accuracy of the photometry in the 
Planck bands (second, third and fourth panel from the top 
in Fig. 7) is much better than at 3 THz. Indeed the uncer¬ 
tainty goes down to less than 10% for 68% of the sources at 
S/N > 6, once the data are bias-corrected. In fact, the flux 
densities in the Planck bands are systematically underesti¬ 
mated, i.e., by 10% at low S/N and by 5% at high S/N, as 
a consequence of the underestimation of 6 (see Sect. 4.3), 
a well-known effect also studied in the framework of both 
the ERCSC Planck Collaboration VII (2011) and the PCCS 
Planck Collaboration XXVIII (2014); Planck Collaboration 
XXVI (2015). This bias is about at the same level as the flux 
density uncertainty, thus it is included in the 1 cr uncertainty. 

5. Distance estimates 

In this section we describe the derivation of the distance es¬ 
timates for the PGCC sources which have been obtained by 
using four different methods: i) cross-checking with kine¬ 
matic distance estimates already available, ii) using the op¬ 
tical or near-infrared extinction due to the PGCC sources as 



SNR 


Fig. 7. Ratio of the recovered (OUT) to injected (IN) flux 
densities for the simulated sources detected with FQ=1 and 
2. The ratio is given as a function of the detection S/N es¬ 
timated from the cold residual maps. The light and dark 
grey shaded regions (denoted by the dot-dashed and long- 
dashed lines , respectively) highlight the behaviour of 95% 
and 68% of the sources, respectively, in each S/N bin. Note 
that at 3 THz (top panel ) we only consider sources with 
FQ= 1, and the corresponding contours and shaded regions 
are highlighted in colour. The median of the ratio distri¬ 
butions are shown with a solid line. The 0%, 10% and 50% 
uncertainty levels are overlaid using a red solid, dashed and 
dotted line, respectively. 
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an indicator of their distance, iii) associations with known 
molecular complexes, iv) estimates from the literature. 

5.1. Kinematic 

The Simon et al. (2006a) and Jackson et al. (2008) cata¬ 
logues of infrared dark clouds (IRDC) provide kinematic 
distances for 497 objects. These distances are obtained by 
combining the gas observed radial velocity with a Galactic 
rotation curve, in the assumption of gas circular motion. 
Accordingly, an observed radial velocity at a given longi¬ 
tude corresponds to a unique Galactocentric distance so¬ 
lution while, at least in the inner Galaxy, two heliocen¬ 
tric distances are allowed. By cross-correlating the PGCC 
sources with the IRDC catalogues in a 5' radius, we have 
found 92 associations, mainly located along the Galactic 
plane. To these sources we have assigned the distance flag 
DIST_KINEMATIC. We note that, when two heliocentric so¬ 
lutions are available, we always choose the near solution. 
An arbitrary 25% uncertainty on these distance estimates is 
adopted. In the following, we will refer to kinematic dis¬ 
tances as method [1]. 

5.2. Optical extinction 

Distances derived from optical extinction are based on 
processing of two independent Sloan Digital Sky Survey 
(SDSS) photometry-based data sets containing the com¬ 
puted distances and interstellar reddening to each star. The 
first data set, based on SDSS DR7 photometry and cov¬ 
ering 1467 PGCC sources, is that of Berry et al. (2012) 
who fit reddening and the stellar locus colours of Covey 
et al. (2007) to the observed photometry. The second data 
set, from Me Gehee (2015 in preparation), makes use of 
the SDSS DR9 catalogue. This targets a total of 1769 cold 
clumps and computes the reddening from g - i colour ex¬ 
cess, where the intrinsic stellar g - i colour is derived from 
the reddening invariant indices defined by McGehee et al. 
(2005). 

Distance moduli values and uncertainties for each cold 
clump are inferred from analysis of 100 Monte Carlo runs 
using stars within a 10' radius of the catalog position. For 
each realization the distance moduli and E(B-V) reddening 
of each star are randomly varied assuming normal distribu¬ 
tions of N{ 0,0.282 2 ) and N{ 0,0.072 2 ), respectively. These 
variance values, which are similar for both data sets, were 
computed by propagating the observed stellar locus width 
in low extinction (A,. < 0.05) regions and the stated photo¬ 
metric uncertainties through the relations for E{B - V) and 
m — M. 

Each of these profiles are processed by a Canny edge 
detection filter (Canny, 1986) with the location of the cold 
clump set by the distance modulus for which the edge de¬ 
tection signal is maximized. A Gaussian sigma of 0.3 mag¬ 
nitudes in m - M is used in the Canny filter for smooth¬ 
ing and noise reduction. Implicit in this approach is the as¬ 
sumption that there is only a single interstellar cloud along 
the line-of-sight. For each PGCC source we assign a value 
and uncertainty to the distance modulus based on the mode 
and standard deviation of the distribution. This distribution 
is obtained via kernel density estimation [KDE] on a 0.01 
magnitude grid using the values returned from the Monte 
Carlo realizations and with the bandwidth of the KDE set 
by the normal distribution approximation. 


We adopted the following prescriptions of Me Gehee 
(2015 in preparation) to build a final selection of robust 
distance estimates: i) distance estimates obtained with the 
M dwarf based technique (Me Gehee, 2015 in prepara¬ 
tion) towards sources with an extinction E(B - V) < 0.4 
are rejected, and ii) distance modulus estimates with uncer¬ 
tainties larger than 1.0130 and 0.7317, for the Berry et al. 
(2012) or Me Gehee (2015 in preparation) methods, respec¬ 
tively, are also rejected. Furthermore we have performed a 
sanity check on the altitude of the sources, rejecting those 
with an altitude above or below 2 times the Galactic scale 
height, which has been recently estimated by Jones et al. 
(2011) at 119±15pc. 

Two sets of distance estimates are finally provided 
using optical extinction with SDSS data, depending on 
the SDSS data version. Hence 1083 sources have been 
assigned a distance estimate based on the DR7 SDSS 
data version using the Berry et al. (2012) method [2] 
(distance flag DIST_0PT_EXT_SDSS_DR7) and 191 sources 
based on the DR9 SDSS data version using the Me 
Gehee (2015 in preparation) method [3] (distance flag 
DIST_0PT_EXT_SDSS_DR9). 

5.3. Near-infrared extinction 

By comparing observed stellar colours to the predictions of 
the Besangon Galactic model (Robin et al., 2003; Robin 
et al., 2012), we have attempted to infer the most prob¬ 
able three-dimensional extinction distribution along the 
line-of-sight. The line-of-sight extinction is parametrised 
using a number of points, each described by a distance 
and an extinction. These parameters are probed using a 
Markov Chain Monte Carlo (MCMC) method, based on 
the Metropolis Hastings algorithm with 10 5 iterations. 
Modeled stars are reddened using linear interpolation be¬ 
tween points and comparison with observations is per¬ 
formed on the colour distribution using a Kolmogorov 
Smirnov test. 

Stars are chosen from the Two Micron All Sky Survey 
(2MASS, Skmtskie et al., 2006) point source catalogue 
that lie within the ellipse defining the cold clump, so in the 
location of the cloud along the line-of-sight should be de¬ 
tectable as a sharp rise in extinction. The resultant extinc¬ 
tion vs distance profile is then analysed to detect the pres¬ 
ence of any clouds. The dust density with respect to dis¬ 
tance is calculated via the derivative of the extinction dis¬ 
tance relation and the diffuse extinction is estimated from 
the continuum. Any peaks in dust density 3 cr over the dif¬ 
fuse are flagged. If a line-of-sight contains more than one 
cloud, the one with the highest extinction is chosen and 
the presence of a second cloud is flagged. Only lines-of- 
sight with a single detected cloud have been included in the 
present PGCC catalogue. 

The principle of the method is similar to that described 
by Marshall et al. (2006) and Marshall et al. (2009). The 
motivation for the change in our case is to provide more 
robust estimates when stellar density is low, as well as to 
more fully characterize the uncertainty via the MCMC ex¬ 
ploration of the parameter space. 

The former version of this method was first applied in 
Marshall et al. (2009) on the Simon et al. (2006a) MSX 
catalogue of IRDCs, providing 1218 distance estimates. 
After cross-correlating the PGCC sources with this IRDC 
catalogue using a 5' radius, we have found 182 associa- 
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tions, leading to the distance estimates of the method [4] 
(DIST_NIR_EXT_IRDC field). We have adopted an uncer¬ 
tainty for the distance estimate of 25% following Marshall 
et al. (2009) prescriptions. The improved algorithm has 
been then applied to the PGCC sources using 2MASS data. 
After performing the sanity check on the altitude (as for the 
optical extinction methods), and including 787 upper limits 
(marked as negative estimates), this has led to 2810 esti¬ 
mates (method [5], DIST_NIR_EXT field). The uncertainty 
on these new set of distance estimates is provided individu¬ 
ally by the algorithm. 

5.4. Molecular complexes 

A simple inspection of the all-sky distribution of cold 
clumps (see Fig. 1) suggests that it follows the distribution 
of known molecular complexes at intermediate latitude, as 
it is illustrated in Appendix B. Many of these molecular 
complexes have distance estimates in the literature. To as¬ 
sign the distance of a complex to a particular cold clump, 
we have used the all-sky CO Planck map. In particular, we 
have checked the presence of a given PGCC source inside 
a molecular cloud, by using a mask generated from the CO 
map. This method has been applied to 11 molecular com¬ 
plexes listed in Table 3 and located outside the Galactic 
plane, which allows us to minimize the effect of confusion. 
Following this procedure, we have obtained 1895 distance 
estimates (method [6], DIST_M0LECULAR_C0MPLEX field) 
with associated uncertainties. 

5.5. Herschel follow-up 

As mentioned in the introduction and further discussed 
in Sect. 7 we have performed a high angular resolution 
follow-up with the PACS and SPIRE instruments on-board 
Herschel, in the framework of the Herschel key-programme 
Galactic Cold Cores (HKP-GCC). This follow-up pro¬ 
gramme has targeted 349 PGCC sources, for which we have 
obtained distance estimates from the literature (Montillaud 
et al., 2014 submitted), the most reliable of which (228 
sources) are reported in the final catalogued and flagged 
as DIST_HKP_GCC field. In the following, we refer to this 
method as method [7]. 

5.6. Combined distance estimates 

We rank all methods by increasing order of confidence 
level, starting with the kinematic estimates ([1]), which sev¬ 
eral authors indicate as less reliable than extinction esti¬ 
mates (Foster et al., 2012), due to the distance ambiguity 
in the inner Galaxy. The estimates derived from optical ex¬ 
tinction are ordered as [2] then [3], according to the SDSS 
data release version (DR7 to DR9). The estimates obtained 
from the near-infrared extinction come next, starting from 
those obtained towards IRDCs ([4]), and then considering 
the ones from the improved algorithm ([5]). The associa¬ 
tion with molecular complexes ([6]) provides consistent es¬ 
timates for sources belonging to the same cloud. Finally, the 
distance estimates derived from the analysis of the Herschel 
observations appear as the most reliable ([6]). 

In the final catalogue we have assigned to each source 
a unique distance value (DIST) , corresponding to the dis¬ 
tance estimate with the highest confidence level among the 
available estimates. This approach allows us to avoid as- 


Table 3. Molecular complexes used to assign a distance es¬ 
timate to the PGCC sources. For more details see Sect. 5.4. 


Name 

1 

[deg] 

b 

[deg] 

Area 

[deg 2 ] 

Distance 

[pc] 

No. 

Aquila Serpens . 

28 

3 

30 

260 

± 

55 

[1,2] 

51 

Polaris Flare . . . 

123 

24 

134 

380 

+ 

40 

[3] 

68 

Camelopardalis . 

148 

20 

159 

200 

+ 

30 

[3] 

19 

Ursa Major .... 

148 

35 

44 

350 

+ 

35 

[3] 

22 

Taurus . 

in 

-15 

440 

140 

+ 

15 

[2] 

384 

Taurus Perseus . . 

163 

-15 

440 

230 

+ 

20 

[2] 

224 

A Ori. 

196 

-13 

113 

400 

+ 

40 

[4] 

70 

Orion. 

212 

-9 

443 

450 

+ 

50 

[2] 

333 

Chamaeleon. . . . 

300 

-16 

27 

170 

+ 

15 

[5] 

114 

Ophiuchus. 

355 

17 

422 

150 

+ 

5 

[2] 

316 

Hercules. 

45 

9 

35 

300 

+ 

75 

[6] 

19 


PI Bontemps et al. (2010); ^ Loinard (2013); ® Schlafly et al. (2014); 
PP Murdin & Penston (1977); ® Bertout et al. (1999); ® Andersson 
et al. (1991). 


signing to a source an average distance computed from 
individual estimates obtained from very different meth¬ 
ods. Importantly, we have checked the internal consistency 
among the available estimates for a given clump using the 
following relation: 



where n is the number of available distances, D, and Dj are 
the distance estimates and their corresponding standard de¬ 
viation <t, and (T j , and C 2 is the number of combinations of 
pairs between the n estimates. A value X» < 1 indicates that 
the mean distance between distance estimates is compati¬ 
ble with a 1 cr uncertainty on each estimate. We have used 
the Xo parameter to assign a DIST_QUALITY flag to each 
source. In particular, Xo can take up the following values: 

0. "No estimate" : no distance estimate available; 

1. "Consistent" : few estimates are available and consistent 

within lcr (Xo < 1); 

2. "Single" : one single estimate; 

3. "Unconsistent" : few estimates are available, but not 

consistent within lcr (Xo > 1); 

4. "Upper limit" : only an upper limit is available. 

Following the procedure described above, we have ob¬ 
tained a total of 5574 distance estimates distributed across 
the whole sky with an associated DIST_QUALITY flag. The 
method selected for each distance estimate (DIST) is spec¬ 
ified with the DIST_0PTI0N field, which ranges from 0 to 
7 (see Table 4). The statistical properties of the distance 
distribution are shown in Table 4. We emphasize that these 
distance estimates have been compiled without taking into 
account the quality of the photometric measurements. In to¬ 
tal, there are 4655 PGCC sources with DIST.QUALITY flag 
equal to 1 and 2. Of these, only 2489 sources have reliable 
flux densities in all bands (FQ=1), while 1378 sources have 
reliable flux densities in the Planck bands, but not at 3 THz 
(FQ=2). For more details see Table 5. 

The cross-correlation between the DIST_QUALITY and 
FLUX_QUALITY flags is shown in Table 5, and reveals that 
only 2489 sources have both reliable distance estimates 
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Fig. 8. All-sky distribution of the 4655 PGCC sources for which a distance estimate with a DIST_QUALITY flag equal to 
1 or 2 is available. The various types of distance estimates are defined as follows : kinematic (purple), optical extinction 
(blue), near-infrared extinction (green), molecular complex association (orange), and Herschel HKP-GCC (red). We also 
show the distribution of the 664 sources with an upper-limit estimate (DIST_QUALITY=4) provided by the near-infrared 
extinction method in light green. The regions covered by the molecular complexes are shown as black contours (see 
Table 3). 



Distance [kpc] 

Fig. 9. Distance distribution per type: kinematic, optical ex¬ 
tinction, near-infrared extinction, molecular complex asso¬ 
ciation, and Herschel HKP-GCC. 

DIST_QUALITY=1 or 2) and reliable flux densities in all 
bands (FQ=1), while 1378 sources have reliable flux densi¬ 
ties in Planck bands but not at 3 THz (FQ=2). 

The spatial distribution of the sources having a distance 
estimate (see Fig. 8) strongly depends on the adopted dis¬ 
tance method. For example, sources with distances derived 
from optical extinction (blue) mainly follow the SDSS sky- 
coverage, while distances obtained from associations with 
molecular cloud complexes present a patchy distribution, 
mirroring that of the host clouds (orange). The near-infrared 
extinction estimates (green) are mostly concentrated to¬ 
wards the inner Galactic plane where the density of stars 
is sufficient to provide high extinction contrast, while the 
near-infrared extinction upper-limit estimates (light green) 
are spread at larger latitudes for the same reason. 



x [ kpc] 


Fig. 10. Distribution of the PGCC sources as seen from 
the North Galactic pole. Only distance estimates with a 
DIST_QUALITY flag equal 1 and 2 are plotted. Different 
methods have been used to derive the distances: kine¬ 
matic (purple), optical extinction (blue), near-infrared ex¬ 
tinction (green), molecular complex association (orange), 
and Herschel HKP-GCC (red). We also show the distribu¬ 
tion of the distance upper-limits (DIST_QUALITY=4) pro¬ 
vided by the near-infrared extinction method (light green). 
The red dashed circle shows the 1 kpc radius around the 
Sun. Black dashed lines represent the spiral arms and the 
local bar. The black circles, centred on the cross, provide 
an indication of the location of the molecular ring. 

A similar type of consideration applies to the analy¬ 
sis of the statistical distribution of all distance estimates 
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Table 4. Number of sources with a distance estimate for 
each of the seven methods, before (column 2) and after 
(column 3) the combination process described in Sect. 5.6, 
in which only one distance is selected among the ones 
available. The repartition between methods of the final dis¬ 
tance estimates provided in the catalogue (DIST field) is 
shown in columns 4 to 7 for each category of the distance 
quality (DIST_QUALITY flag) : Consitent (1), Single (2), 
Unconsistent (3), Upper limit (4). 


Method 

No 

Init. Final 

1 

DIST.QUALITY 

2 3 

4 

[1] Kinematic . . . 

92 

23 

- 

23 

- 

- 

[2] Opt. Ext. DR7 

1083 

719 

- 

717 

2 

- 

[3] Opt. Ext. DR9 

191 

173 

63 

89 

21 

- 

[4] NIR Ext. IRDC 

182 

106 

19 

81 

6 

- 

[5] NIR Ext. 

2810 

2491 

138 

1601 

88 

664 

[6] Complexes . . 

1895 

1834 

177 

1576 

81 

- 

[7] HKP-GCC . . . 

228 

228 

67 

104 

57 

- 

Total . 


5574 

464 

4191 

255 

664 


Table 5. Number of sources with a distance estimate in each 
category of the DIST.QUALITY and FLUX.QUALITY flags. 


DIST.QUALITY FLUX.QUALITY 



1 

2 

3 

1 

240 

146 

78 

2 

2249 

1232 

710 

3 

116 

88 

51 

4 

335 

220 

109 


(see Fig. 9). The near-infrared extinction and kinematic 
methods allows us to probe distant regions (from 1 kpc to 
9kpc) across the Galactic plane. On the contrary, the op¬ 
tical extinction and molecular complex associations meth¬ 
ods are applicable in the nearby Galaxy only (up to 1 kpc 
and 0.5 kpc, respectively). As seen from the North Galactic 
pole (Fig. 10), the complementarity of the different meth¬ 
ods shows clearly. About 88% of the sources with a reli¬ 
able distance estimate (DIST_QUALITY=1 or 2) lie within 
2 kpc from the sun. Therefore, the PGCC catalogue mainly 
probes the solar vicinity. It is interesting to notice that the 
distribution of the PGCC sources at larger distance fol¬ 
lows at first order the Galactic arms and the molecular ring. 
This is especially significant for the Perseus arm towards 
the outer Galaxy and for the Scutum-Centaurus and Norma 
arms in the inner Galaxy. We conclude this section by em¬ 
phasizing that, due to the variety of distance estimators, any 
statistical analysis involving distances or the related quanti¬ 
ties, will be affected by severe biases which, given the fact 
that the catalogue is not flux density complete, are very hard 
to quantify. 

6. Physical parameters 

For the Planck Galactic cold clumps we have derived: tem¬ 
perature, column density and, when distance estimates are 
available, size, mass, mean density and luminosity. The 
propagated uncertainties on the computed quantities are ob¬ 
tained, for each clump, from 10 6 Monte-Carlo simulations 
and are provided in two different fashions: i) as lcr standard 


dispersion; ii) as defined by the lower and upper limits of 
the 68%, 95% and 99% confidence intervals of the Monte 
Carlo distributions. The latter takes into account the non 
gaussian behaviour of the same Monte-Carlo distributions. 

Figure 11 illustrates the temperature, column density, 
size, mass, mean density and luminosity distributions for 
the PGCC sources with FLUX.QUALITY equal to 1 and 2. 
We recall that sources with FQ=1 have reliable flux den¬ 
sities in both the IRIS 3 THz and Planck bands, which al¬ 
lows the derivation of reliable temperatures and emissivity 
index, while sources with FQ=2 are likely faint and cold, 
and their temperatures estimated are obtained using only 
the three Planck bands and a fixed emissivity spectral index 
of 2. In the case of physical quantities for which a distance 
estimate is needed, we also require that DIST.QUALITY is 
either 1 or 2. 

Additionally, we investigated the impact of the individ¬ 
ual uncertainties estimated above on the overall source cat¬ 
alogue distribution for each physical quantity. To this end, 
we have again used a Monte-Carlo approach. In this case, 
for each quantity, we build 10000 synthetic samples with 
the same number of sources as the original sample. Then, 
starting from the computed values, we add random noise 
based on the uncertainty associated with each source. At 
this stage, we generate the lcr upper and lower contours for 
each of the 10 000 samples, where the lcr limit is estimated 
based on the median of the distribution. Fig. 11 shows, as 
expected, that the contours follow reasonably well the dis¬ 
tribution of the computed values when the uncertainties are 
relatively small. On the contrary, there is quite a significant 
departure from these when the uncertainties are large. 

6.1. Temperature and emissivity 

The source temperatures and the local warm background 
temperatures have been estimated using the flux density 
measurements and their corresponding uncertainties in the 
IRIS 3 THz band and the Planck 857, 545, and 353 GHz 
channels (see Sect. 2.3). The fits of the SEDs have been 
performed assuming that the emission can be described as 
a modified black body: 

F v = F v flB v (T)(y/vof , (2) 

where F v are the observed flux densities, [i is the emissiv¬ 
ity spectral index, T is the fitted colour temperature, and 
F v o is the fitted flux density at a reference wavelength vo. 
In Eq. 2 we have assumed that the observed emission is 
optically thin at frequencies v < 3 THz, that the emissiv¬ 
ity spectral index is constant within the fitted wavelength 
range, and that the source is isothermal. The first two as¬ 
sumptions likely hold, however, the sources are not neces¬ 
sarily isothermal. In particular, the temperature is expected 
to vary along the line-of-sight due to radiative transfer ef¬ 
fects, with lower values in the inner, denser part of the 
clump than the averaged colour temperature obtained from 
Eq. 2. The derived column density and mass can then be 
underestimated up to a factor 3 (Ysard et ah, 2012). For 
each source, two independent temperature estimates have 
been obtained, i.e., by fixing the emissivity spectral index, 
jS, to 2.0, or by letting [3 be a free parameter. These esti¬ 
mates and their uncertainties have been calculated with a 
MCMC approach, using a Bayesian formulation with flat 
priors distributions for the temperature, emissivity spectral 
index (when this is a free parameter), and the amplitude 
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of the fitted modified black body (see Juvela et al., 2013, 
and references therein). The allowed parameter ranges are: 
5.0 < T < 30K, 0.5 </3 < 5.0 and F vfi > 0.0. 

The MCMC calculations have been made in chunks of 
10 7 iterations. After reaching convergence (following the 
initial burn-in phase), the parameters have been estimated 
from the final 5x 10 7 steps. The parameters values quoted in 
the catalogue correspond to the mean and the marginalised 
68% confidence intervals calculated from the MCMC sam¬ 
ples. The uncertainty of the MCMC temperature estimate 
caused by the finite length of the MCMC chain is always 
negligible. 

For sources with FQ=1, we have compared the tempera¬ 
ture and emissivity spectral index obtained from x 1 fits and 
from the MCMC results, and these are on average within 
1%. For sources with the lowest S/N, the x 1 method tends 
to give higher T values and lower /3 values than the MCMC 
technique. This is consistent with the behaviour found in 
simulations where, when [3 is treated as a free parameter, the 
joint probability distribution of T and /3 is asymmetric and 
presents a long tail towards low temperatures and high val¬ 
ues of the emissivity spectral index (Juvela & Ysard, 2012; 
Shetty et al., 2009). The standard deviations of the MCMC 
vs^ 2 estimates are cr(T) = OAK and cr(J3) = 0.08. Again, 
these numbers reflect mostly the behaviour of sources with 
the lowest S/N measurements. In general, the difference be¬ 
tween the MCMC andyA estimates are not significant when 
compared to the uncertainties. The MCMC results have also 
been compared to^ 2 fits performed over a regular grid with 
a step AT = 0.2 K and A/3 = 0.05, where, at each grid posi¬ 
tion, we fit only the amplitude of the model spectrum. The 
results are consistent within the uncertainties, confirming 
that MCMC has correctly localised the absolute minimum 
across the entire allowed parameter space. 

We stress that the temperature estimates of the sources 
correspond to the temperature of the clump-only after re¬ 
moval of the warm background, while the colour temper¬ 
ature is usually associated with the total emission on the 
line-of-sight. The resulting clump temperature distribution 
is shown in Fig. 11 for the two categories of sources, with 
FQ=1 and 2. When the emissivity spectral index [3 is al¬ 
lowed to vary (solid lines), the temperature of the Planck 
cold clump candidates with FQ= 1 ranges from 8.6 to 30 K, 
and actually from 10.5 to 19.9 K when excluding the 2% 
extreme percentiles, with a peak of the distribution at about 
14.5 K. Likewise, when we take [3- 2, the distribution is 
narrower (ranging from 6 to 22.5 K, and from 11.1 to 
16.8 K when excluding the 2% extreme percentiles) but still 
peaks around 14 K. These values are about 1 K higher than 
the temperature estimates derived in Planck Collaboration 
XXIII (2011). This can be ascribed to the change in calibra¬ 
tion of the Planck high frequency bands (857 and 545 GHz) 
with respect to the data used for the Planck Early Papers 
and the Planck Results 2013 (see Planck Collaboration 
VIII, 2014). The temperature distribution of sources with 
FQ=2 has a peak at lower temperatures, around 13K, and 
spans a range of lower temperatures, down to 5.8 K, while it 
ranges from 8.6 to 22.3 K when excluding the 2% extreme 
percentiles. This is consistent with the aforementioned hy¬ 
pothesis that these sources are colder than sources with 
FQ=1. 

If we compare this actual distribution with the expected 
temperature completeness of Fig. 3, we notice that the 
temperature distribution of the PGCC sample with FQ= 1 


drops to zero below 9 K, while the expected detection effi¬ 
ciency is still about 15% below 8 K for the same category. 
Considering now sources with FQ=2, about 0.5% of the 
sample exhibits temperature below 8 K, but no sources be¬ 
low 5.8 K, where the expected completeness reaches almost 
45% for the same category. The lower temperature limits at 
8.6 and 5.8 K for sources with FQ= 1 and 2, respectively, 
appear therefore as physical thresholds, rather than a bias 
introduced by the detection algorithm. However, it is im¬ 
portant to keep in mind that colder sources (below 6 K) may 
still exist on smaller angular scales. 

6.2. Size 

The geometric parameters of the PGCC sources are derived 
from the photometric measurements by means of an ellip¬ 
tical Gaussian fit to the 857 GHz cold residual map with 
an elliptical Gaussian. From the fit, we can estimate: the 
source centroid, its major and minor axis, ellipticity and po¬ 
sition angle. In the catalogue, these quantities are available 
only for sources with FQ=1 or 2. 

The distribution of 6, defined as the geometric mean of 
the major and minor FWHM, 6 — ^0„ vdi 0 mm , is shown in 
Fig. 11. The PGCC sources have an average 6 of 7'5 (red 
dashed line), thus appearing slightly extended with respect 
to the effective beam, Qpsf, at 857 GHz (4.325+0(055, 
Planck Collaboration I, 2014). We have also computed the 
intrinsic size, 0,, of the sources by deconvolution of the ob¬ 
served diameter from the effective beam, and yielding an 
average value of 5(6. We have obtained an average Oj/OpsF 
ratio of 1.4, which is consistent with other similar ob¬ 
servations, e.g., by BLAST (1.1, Netterfield et al., 2009) 
and Bolocam (1.5, Enoch et al., 2007). This effect might 
have to do with the hierarchical structure of the interstellar 
medium, and to the fact that cold clumps are likely to be 
characterized by the presence of extended envelopes. Note 
that, statistically speaking, PGCC sources with FQ=1 or 2 
have comparable 6 distributions. 

The ellipticity of the sources is defined as: 



As shown in Fig. 11, PGCC clumps exhibit a distribu¬ 
tion of the ellipticity peaking around 0.9 with a median 
value of 0.83, larger than the average ellipticity of the 
effective beam at 857 GHz, which is about 0.70 (Planck 
Collaboration I, 2014). While the uncertainty on the ma¬ 
jor and minor FWHM determination can lead to an artifi¬ 
cial elongation of the sources, we estimate that a 45% level 
of uncertainty is required to produce a median value of the 
ellipticity of 0.83 instead of 0.7, whereas the effective accu¬ 
racy of the recovered ellipticity is about 10% (see Sect. 4.3). 
Therefore, the observed elongation of the PGCC sources 
cannot be explained by noise only. Furthermore, we have 
verified that the position angle of the PGCC sources is not 
correlated with the Planck scanning strategy. Indeed, for the 
previous catalogue version, particularly the ECC (Planck 
Collaboration VII, 201 1), a statistical analysis revealed that 
the orientation of the sources was slightly spatially corre¬ 
lated with the scan direction. Such an effect was induced 
by the instrument transfer function, which at the time was 
not fully characterized. This issue has been resolved in the 
public release of Planck data (see Planck Collaboration VII, 
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Fig. 11. Distribution of the physical properties of the PGCC objects with FQ=1 (blue) and FQ=2 (green). The coloured 
shaded regions provide an estimate of the impact of the individual uncertainties on the statistical distribution at a lcr dis¬ 
persion level around the mean value obtained from 10000 MC realizations (see Sect. 6 for more details). The temperature 
distribution is shown for two temperature estimates, i.e., using a free emissivity spectral index [S ( solid line) or a fixed 
/3-2 ( dot-dashed line). The distribution of the intrinsic size, after deconvolution, is shown on the top right panel in dot- 
dot-dot-dashed line. The distributions of the physical size, mass, mean density and luminosity of the clumps are provided 
for sources with reliable distance estimates (DIST_QUALITY=1 or 2). These distributions are also shown separately for 
sources at solar distances smaller or larger than 1 kpc from the Sun, in dashed and dotted lines , respectively. 

16 






















































































Planck Collaboration: PGCC catalogue 


2014) used to build the present catalogue. About 40% of 
sources have 0.87 < s < 0.95, which translates to ax¬ 
ial ratios between 2 and 3. PGCC clumps appear therefore 
clearly elongated, which may be due to their association 
with filamentary structures. 

For the sources with a reliable distance estimate 
(DIST_QUALITY=1 or 2), i.e., 2489 and 1378 sources with 
FQ= 1 and 2, respectively, we have also derived their linear 
size. A few things need to be noted regarding the linear size 
distribution shown in Fig. 11. First of all, this distribution 
is limited to objects for which a distance estimate is avail¬ 
able, and these constitute, as discussed in Sect. 5, a very 
heterogeneous sample of sources. Secondly, this distribu¬ 
tion is strongly influenced by the Planck angular resolution 
(5'), therefore small (< 1 pc) and compact clumps are likely 
local objects (D < 1 kpc), while larger objects (a few pc) 
are likely intrinsically extended structures located at large 
distances (D > 1 kpc). 


6.3. Column density 


The column density of the Planck cold clumps has 
been evaluated following the prescription in Planck 
Collaboration XXIII (201 1): 


Nh 2 = 


s'jn 

pm n B v (T)k v ’ 


(4) 


where S ' is the flux density per beam at the frequency v, 
which is integrated over the solid angle of the clump defined 
by r2=7T0 m j n 0 ma j/4, fi-2.33 is the mean molecular weight, 
mu is the mass of the atomic hydrogen, and k v is the dust 
opacity. We have adopted the dust opacity from Beckwith 
et al. (1990), /f v =0.1(v/lTHz)4cm 2 g _1 , which is appropri¬ 
ate for the case of dense clouds at intermediate densities. 
The column density is computed at 857 GHz, close to the 
1 THz reference of Beckwith et al. (1990), which allows us 
to minimize the impact of assuming a fixed emissivity spec¬ 
tral index, /3=2. Changing the emissivity spectral index in 
the range 1-3 yields up to 15% variations on the emissiv¬ 
ity estimate, which is negligible compared to the intrinsic 
uncertainty of the emissivity. This is also the Planck- HFI 
band where the S/N is the highest, and where dust emission 
remains optically thin. Note that S (, 57 , integrated over 61, is 
half the flux density provided in the catalogue, .S’* 57 , which 
is the flux density integrated over the whole clump. 

The peak column density is defined by: 


jypeak _ 
H 2 - 


/y 

P m U By(T) Ky ’ 


(5) 


where I v is the surface brightness measured at the peak, 
and it can be derived from Ah, by multiplying by 1.38. We 
stress that these two estimates of the column density are de¬ 
rived for the Planck clumps only after removal of the warm 
component, which is not what is typically found in the lit¬ 
erature. 

The Ah, distribution is shown in Fig. 11. This 
ranges from 6 . 8 xl 0 18 to 1 . 2 xl 0 23 cm 2 , with a median of 
3.4 x10 20 chT 2 and 6.3xl0 20 cm ~ 2 for sources with FQ=1 
and 2 , respectively. 

Sources with FQ=2 have slightly larger column den¬ 
sities, which is expected if these sources effectively cor¬ 
respond to colder and denser objects. About 80% of the 
sources have a column density between 1.4xl0 20 and 
3.7xl0 21 cm -2 . We emphasize that these column densities 


are averaged over the size of the clump which, at the Planck 
resolution, is likely inducing a bias at low values, be¬ 
cause of beam dilution effects. Indeed, as shown in Planck 
Collaboration XXII (2011), the Herschel higher angular 
resolution observations of a sample of PGCC sources have 
revealed complex sub-structures, characterized by lower 
temperatures and higher densities compared to the parent 
clump. 


6.4. Mass and mean density 


For sources with a reliable distance estimate, the mass of 
the clump is given by: 


M = 


SyD 2 

Ky B V (T) ’ 


( 6 ) 


where S v is the flux density measured at 857 GHz, inte¬ 
grated over the solid angle 61 defined in Sect. 6.3, D is 
the distance, k v is the dust opacity defined in Sect. 6.3, and 
B V (T) is the Planck function for a dust temperature T. 

The mass distribution shown in Fig. 11, ranges from a 
few 10 2 to almost 10 5 M e , probing a large variety of ob¬ 
jects, from cores to giant molecular clouds. We stress that 
this mass distribution, as the linear size distribution dis¬ 
cussed in Sect. 6.2, is biased by our distance sample, which 
we know being highly heterogeneous (see Sect. 5). This be¬ 
comes clear when we partition the mass distribution accord¬ 
ing to distance, in particular by separating sources located 
closer ( dashed line) or further ( dotted line) than 1 kpc from 
the Sun. Furthermore, except for less than 40% of the cases 
where the distance estimates are obtained from molecu¬ 
lar complex association providing highly reliable estimates, 
the uncertainty on the computed mass is mainly dominated 
by the uncertainty on the distance, which is about two to 
three times larger than the uncertainties on the temperature 
and flux density involved in the calculation. Hence about 
84% of sources have mass estimates with a S/N between 1 
and 2, and 16 % with a S/N above 2. Note also that we have 
almost reached the theoretical sensitivity limit of Planck to 
low-mass cold cores, as it has been derived in Sect. 4.2, 
which is about 0.03 M 0 for a cold source located at 100 pc 
and having a column density of 10 2() cm -2 . 

Finally, we have obtained the mean density of the 
clump, computed as n^-M/V, where V is the volume of 
the clump, modeled as a sphere of diameter equal to the 
physical size of the clump derived in Sect. 6.2. Figure 11 
shows that the mean density ranges from 5.3 to 3.5 x 
10 4 cm~ 3 , again spanning a wide range of object categories. 
While the smallest estimates are not consistent with typical 
density of large clouds (i.e., ~ 10 2 cm" 3 , Williams et al., 
2000; Blitz, 1993), a closer look at the correlation with the 
size of the objects reveals that these values can be explained 
by a dilution effect in the Planck beam, as discussed in 
Sect. 6 . 6 . 


6.5. Luminosity 

We have computed the bolometric luminosity of the cold 
clumps as: 

L = 4kD 2 J's'ydv, (7) 

where D is the distance and S' v is the flux density of the 
clump modeled with a modified black body function with 
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Table 6. Statistical description of the physical properties of the PGCC clumps with FQ=1. The statistics of the quantities 
denoted with has been computed on the sub-sample of sources having a distance estimate with DIST_QUALITY= 1 or 2. 


Percentile T c FWHM size* s 1V H , D* M* n H ,* L* 

_[K] □ [pc]_[cm- 2 ] [kpc] [Me] [cm- 3 ] [L 0 ] 

min. 8.6 3.4 0.14 0.10 6.8xl0 18 0.07 5.1xl0- 2 5.3x10° 7.6xl0- 2 

first 1% _ 10.5 4.6 0.20 0.36 3.2xl0 19 0.12 1.4x10-' 1.6x10' 1.4x10-' 

first 10% ... 12.1 5.8 0.27 0.61 9.8xl0 19 0.14 4.5x10-' 6.8x10' 3.6x10-' 

median. 14.5 7.5 0.71 0.83 3.4xl0 20 0.33 3.3x10° 3.0xl0 2 2.1x10° 

last 10%_ 17.0 9.3 1.91 0.94 1.8xl0 21 0.92 5.9x10' 2.4xl0 3 6.9x10' 

last 1%. 19.9 11.0 11.72 0.97 8.6xl0 21 5.51 6.8xl0 3 1.3xl0 4 7.5xl0 3 

max. 30.0 13.1 25.16 0.98 8.8xl0 22 10.48 2.6xl0 4 3.5xl0 4 7.6xl0 4 


Table 7. Same as Table 6 with FQ=2. The statistics of the quantities denoted with ★ has been computed on the sub-sample 
of sources having a distance estimate with DIST_QUALITY= 1 or 2. We recall that for this class of sources, we only provide 
an upper limit of the temperature. 


Percentile 

T c 

FWHM size* 

£ 

N H2 

D* 

M* 

Hh 2 * 

L* 


[K] 

['] [pc] 


[cm- 2 ] 

[kpc] 

[Me] 

[cnT 3 ] 

[L 0 ] 


min. 5.8 2.8 0.15 0.16 

first 1% _ 8.6 4.2 0.19 0.37 

first 10% ... 11.0 5.6 0.28 0.61 

median. 13.0 7.5 0.95 0.84 

last 10%_ 15.7 9.7 8.55 0.95 

last 1%. 22.2 11.7 15.69 0.98 

max. 30.0 14.5 30.56 0.98 


1.4xl0 19 

0.07 
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Fig. 12. Distribution of the luminosity to mass ratio of the 
PGCC clumps for sources with FQ= 1 (blue line) and FQ=2 
(green line). The dot-dot-dot-dashed line denotes the distri¬ 
bution for the sub-sample of sources with a reliable distance 
estimate. 

temperature T and emissivity spectral index /I, as described 
in Sect. 6.1, and normalized to S v defined in Sect. 6.3. The 
integral is taken over the frequency range 300 GHz < v < 
lOTHz, which extends beyond the IRAS and Planck spec¬ 
tral coverage. Any emission at shorter wavelengths is not 
included in this calculation. From Fig. 11 we note that the 
luminosity distributions for sources in the FQ= 1 and 2 cate¬ 
gories are very heterogeneous, ranging from 10- 2 to 10 6 L 0 . 

Figure 12 illustrates the L/M distribution. This quantity 
presents the advantage of not depending on distance, thus 
it can be derived for all the sources in the catalogue. It is 
often used as an indicator of the evolutionary stage of cold 
sources. About 73% and 83% of the sample with FQ= 1 and 


2, respectively, have a L/M < 1, which is typical of ob¬ 
jects with low degree of evolution. Interestingly, we find 
the same proportion when we analyse the L/M distribution 
(i dotted lines ) for objects with a reliable distance estimate, 
suggesting that the sample of sources with distance esti¬ 
mates is statistically representative of the entire sample. 

6.6. Correlation between physical properties 

A summary of the statistical properties of the PGCC clod 
clumps physical quantities is provided in Table 6, for 
sources with FQ=1, and in Table 7, for sources with FQ=2. 
For each quantity the 1%, 10% lower and higher percentiles 
and the minimum, median and maximum values are re¬ 
ported. We stress that there is no systematic correlation be¬ 
tween physical properties for the objects in the lowest and 
highest percentile bins. Indeed, the existence of a potential 
correlation between the physical properties of the clumps is 
explored in Fig. 13. Only sources with robust distance es¬ 
timates (DIST_QUALITY=1 or 2) and reliable flux densities 
(FLUX_QUALITY=1 or 2) have been considered, amounting 
to a total of 3867 clumps. The locus of the Simon et al. 
(2006b) IRDCs properties is shown as red points, including 
local thermodynamic equilibrium (LTE) mass estimates. 

As expected, the clumps size and mass are strongly cor¬ 
related with the distance estimate, while the FWHM of the 
PGCC sources is mainly constrained by the 5' Planck beam, 
ranging from 5.'6 to 9.'7 for 80% of the sources. Artefacts 
due to the distance association with molecular clouds are 
visible as straight lines in all the scatter plots including 
the distance. Comparing the PGCC distributions (black) 
to the IRDC population (red), it appears that the PGCC 
Galactic cold clumps correspond statistically to smaller ob¬ 
jects located in the solar neighbourhood and detected at all 
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Fig. 13. Correlation plots for: temperature, column density, distance, size and mass. The black dots represent 3867 PGCC 
sources with a robust distance and flux density estimate. The red dots denote the IRDCs from Simon et al. (2006b).between 
the following physical properties of the 3867 PGCC sources for which a robust distance estimate is available combined 
with a reliable flux density: temperature, column density, distance, size and mass. It is compared to the physical properties 
of IRDCs from Simon et al. (2006b) in red. 


latitudes, while the IRDCs are preferentially detected to¬ 
wards the Galactic plane and represent distant structures. 
Furthermore, IRDCs appear more massive and dense than 
the PGCC sources. 

It is also interesting to notice that the population of 
PGCC at very low column density (< 10 20 cm 2 ) corre¬ 
sponds to sources with an intermediate linear size, from 0.5 
to 2 pc. This is fully consistent with the dilution of a dense 
core of 0.1 pc with a column density of 10 22 cm 2 at a dis¬ 
tance of 500pc in the Planck beam (5')- Similarly, we may 
explain the very low density estimates (a few cm -3 ), which 
are not expected even for large clouds. 

7. Ancillary Validation 

7.1. Identification with Planck internal catalogues 

We have performed a positional cross-correlation of the 
PGCC catalogue with the other Planck internal cata¬ 
logues, using a 5' radius. In particular, we have con¬ 
sidered: the Early Cold Cores catalogue (ECC), the 
Planck Catalogue of Compact Sources (PCCS), the Planck 
Catalogue of Sunyaev-Zeldovich sources (PSZ), and the 
Planck Catalogue of High-redshift source candidates (PH-;). 
The results are summarized in Table 8. 


7.1.1. ECC 

The Early Cold Cores catalogue contains 915 sources that 
were detected with high S/N by the Planck Early Mission 
using the same colour-detection method used for the PGCC 
catalogue. We find matches in positions for 892 ECC 
sources. The missing 23 ECC sources have a strong signal 
in all three detection maps. However, we have deblended 
the detection maps using the local maxima of the detec¬ 
tion signal (see Montier et al., 2010, Sect. 5.2), and adopted 
the positions of these local maxima for the coordinates of 
the detections at each frequency. The individual frequency 
catalogues have then been merged to create the final cata¬ 
logue. This last step of band merging has been improved in 
the final version of the catalogue to ensure a higher degree 
of compactness of the detected clump across the frequen¬ 
cies. The missing 23 ECC sources do no longer satisfy the 
compactness criterion. This does not mean that these ECC 
sources are spurious, rather it suggests that they are slightly 
more extended compared to the rest of the sources in the 
catalogue, hence they are discarded. 

7.1.2. PCCS 

The Planck Catalogue of Compact Sources (PCCS, Planck 
Collaboration XXVI, 2015) contains frequency catalogues 
in each Planck band.The PCCS detections are obtained 
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Table 8 . Result of the positional cross-correlation of the PGCC catalogue with the other Planck internal catalogues. 


PGCC FLUX_QUALITY 

Catalogue option N 1 2 3 Total 

(6993) (3755) (2440) (13188) 

ECC..915 622_237_55_892 

PCCS 857 . zoneO 4891 55 9 5 69 

zone 1-3 43290 5361 2511 1725 9597 

PCCS 545 . zone 0 1694 67 15 12 94 

zone 1-3 31068 5010 2412 1596 9018 

PCCS 353 . zoneO 1344 59 17 8 84 

zone 1-3 22665 4645 2227 1393 8265 

PCCS 217 . zoneO 2135 168 15 27 210 

zone 1-3 16842 3963 1836 1129 6928 

PCCS 143 . zone 0 2160 106 12 9 127 

zone 1-3 4139 959_748_320 2027 

PCCS 100 . zoneO 1742 119 26 20 165 

zone 1-3 2487 545_478_225_1248 

PCCS 70 . zoneO 1101 17 23 14 54 

zone 1-3_195_6_8_6_20 

PCCS 44 . zoneO 830 6 17 13 36 

zone 1-3 104 - 5 1 6 

PCCS 30 . zoneO 1435 16 19 18 53 

zone 1-3_125_2_5_6_13 

PCCS 857x545x357 . zone 0 648 27 8 2 37 

_zone 1-3 20534 3946 1890 1165 7001 

PSZ..1653_31_16_18_65_ 

PHz . 1261 2 13 - 15 


from individual frequency maps to which a Mexican Hat 
Wavelet filter has been applied. This procedure allows re¬ 
moval of extended emission and noise. Because of this data 
treatment, the S/N of PCCS sources depends on the fre¬ 
quency and on the area of the sky. The detections in each 
band are divided into two categories depending on the re¬ 
gion of the sky: zone 0 corresponds to regions where the 
reliability of the sources has been quantified, while zones 1 
to 3 correspond to filaments. Galactic regions or filaments 
in Galactic regions, respectively. The second category of 
PCCS sources is considered to have a lower reliability than 
the zone 0 sources. 

The number of retrieved matches is given in Table 8. In 
the Table we provide the total number of matches (last col¬ 
umn), as well as by FLUX_QUALITY flag (second, third and 
fourth columns). This is given for each sub-category of the 
PCCS catalogue: zone 0 and zones 1-3. Since the PCCS 
catalogue contains warm sources that are rejected from the 
PGCC catalogue, it is not surprising that we do not find 
PGCC matches for all the PCCS detections. However, an 
important fraction of the PGCC sources have a counterpart 
in the PCCS catalogue, especially in the highest frequency 
channels, up to 75% in the 857 GHz band when including 
the zones 1-3 sources. Notice also that the proportion of 
matched sources in the low frequency channels becomes 
extremely small (131 in total) and mainly consists of zone-0 
sources, which could represent a population of extragalac- 
tic radio contaminants that has not been fully rejected. More 
interestingly, the PGCC detection algorithm requires a de¬ 
tection in all three frequency cold residual maps, while the 
PCCS only requires single frequency detections. There are 


21 182 PCCS sources that are simultaneously detected at 
857, 545, and 353 GHz. Of these, 7038 have a match in the 
PGCC catalogue in a 5' radius. Thus the PGCC catalogue 
contains ~ 45% of new sources not already identified in the 
PCCS catalogue in the three upper Planck frequencies as 
cold sources. 

7.1.3. PSZ 

The Planck Catalogue of Sunyaev-Zeldovich sources (PSZ, 
Planck Collaboration XXVII, 2015) contains 1653 detec¬ 
tions. These are exclusively extragalactic sources, so the 
overlap between the PSZ and PGCC catalogues is small, 
i.e., 65 sources. Most of these sources exhibit cold tempera¬ 
ture, except one which is warm. Since the CoCoCoDeT algo¬ 
rithm is suited to detect sources whose SED peaks around 
857 GHz, it is not expected to be sensitive to the SZ effect 
from galaxy clusters, which is characterized by a peak at 
353 GHz and almost no emission contribution at 857 GHz. 
Hence it is likely that this sample of 65 matched sources, ly¬ 
ing at the limit of the Galactic mask used when building the 
PSZ catalogue, consists of Galactic cold clumps at interme¬ 
diate to high latitudes, therefore representing a contaminant 
for the PSZ catalogue. 

7.1.4. PH z 

The Planck Catalogue of high-;; source candidates (PH-;, 
Planck Collaboration, 2015 in preparation) is a catalogue 
of sources detected at high latitude, in the 26% clean¬ 
est fraction of the sky, and consists of a sample of high- 
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redshift candidates identified by their ’red’ colours in the 
Planck highest frequency bands. By cross-correlating the 
PGCC and PHz catalogues, we found 15 common sources. 
This result was partly expected, due to the similar spec¬ 
tral behaviour of Galactic cold objects and extragalactic red 
sources. Nevertheless, at the time of writing the nature of 
these matches is uncertain and requires further investiga¬ 
tion. 

7.2. Crosscheck with follow-up observations 
7.2.1. Herschel imaging 

A sub-sample of the PGCC sources has been further investi¬ 
gated in the Herschel Open Time Key Programme Galactic 
Cold Cores (hereafter, HKP-GCC). Herschel PACS and 
SPIRE instruments were used to observe 115 fields at five 
wavelengths, from 100/mi to 500 pm, with an angular res¬ 
olution from 12" to 37". The fields were selected based 
on an early version of the PGCC catalogue, and target 349 
individual Planck clumps, spanning a wide range in S/N, 
latitude and temperature. 

The sensitivity and resolution of the Herschel instru¬ 
ments (Pilbratt et al., 2010; Poglitsch et al., 2010; Griffin 
et al., 2010) enable detailed studies of the structure of the 
clumps and their interplay with their parent clouds. First 
results of the Herschel follow-up have been presented in 
Juvela et al. (2010,2011,2012) and in Planck Collaboration 
XXII (2011). 

For the purpose of target selection, the Planck cold 
clumps have been binned according to their Galactic coor¬ 
dinates, their estimated dust colour temperature, and their 
mass. The bins are identified by the following boundary 
values: l- 0, 60, 120, and 180 degrees; \h\ =1, 5, 10, and 
90 degrees; T& ust =6, 9, 11, and 14 K; M =0, 0.01, 2.0, 
500,10 6 Mq. Here 7 dusl is the clump cold dust temperature. 
After creating the bins, the targets have been selected using 
a Monte Carlo technique, in which we have uniformly sam¬ 
pled the sources in the various bins. This procedure allows 
us to cover entirely the parameter space, including sources 
at high latitudes and with extreme mass values. The M- 0 
bin has been reserved for sources with no distance estimate. 

Noteworthy, the selection of the PGCC sources had 
to avoid areas of the sky covered by other Herschel Key 

o 

Programmes, such as the Galactic plane for \b\ < 1 , which 
was targeted by the Hi-GAL programme (Molinari et al., 
2010), and several nearby clouds that are included in the 
Gould Belt (Andre et al., 2010) and HOBYS (e.g., Motte 
et al., 2010) programmes. 

The final selection of 115 fields includes 16 fields at 
Galactic latitudes above 20 degrees. The median peak col¬ 
umn density in these fields is Ah, = 1.5 X 10 21 cm~ 2 (for 
details on the column density calculation see Juvela et al., 
2012). In each followed-up field, the Planck clump coordi¬ 
nates clearly identify a coherent structure in the Herschel 
surface brightness maps. This is the case even when the 
surface brightness is below 5 MJy sr^ 1 at 250/fm, suggest¬ 
ing that the CoCoCoDeT algorithm is able to reliably extract 
very low column density features from the Planck data. 

The Herschel data have been used to generate column 
density and colour temperature maps (Montillaud et al., 
2014 submitted). Figure 14 shows the pixel-to-pixel two- 
dimensional distribution of column density vs temperature. 
Pixels located within lcr of the elliptical Gaussian fit of the 
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Fig. 14. Temperature vs. column density two-dimensional 
distribution for all PGCC cold clumps followed-up by 
Herschel in the HKP-GCC programme. The plotted data 
points are identified as IN and OUT: IN points are located 
inside the Planck clump solid angle, while the OUT points 
are located outside. The distribution of the OUT points is 
shown in colour, while the distribution of the IN points is 
denoted by cyan contours at three different levels : 10, 50 
and 90% of the maximum. 

PGCC sources are defined as IN pixels, and are shown with 
cyan contours in the Figure. They exhibit a much narrower 
distribution with respect to pixels located outside the PGCC 
sources, defined as OUT pixels. The IN pixels are mostly 
found in correspondance of the coldest and densest regions 
of the two-dimensional diagram, while the OUT pixels are 
also in regions of low column density and relatively high 
temperature. This result highlights the dense and cold na¬ 
ture of the PGCC sources. 

Based on the Herschel follow-up, Montillaud et al. 
(2014 submitted) have also built a catalogue of com¬ 
pact sources. After removing extragalactic contaminants, 
pre-stellar candidates have been identified with a multi¬ 
wavelength analysis. Montillaud et al. (2014 submitted) 
have performed a detailed analysis aimed at classifying 
the Herschel sources according to their evolutionary stage, 
i.e., from starless cores to Young Stellar Objects (YSO). 
For each Herschel field, the fraction of YSOs and starless 
sources falling inside the 3-cr elliptical Gaussian contour of 
the PGCC clumps are derived, providing a unique informa¬ 
tion of the evolutionary stage of the PGCC clumps observed 
as a whole with Planck. A preliminary analysis of the YSO 
and starless populations in the Herschel fields around the 
349 PGCC sources has already shown that Planck clumps 
contain a mixture of the two populations in various pro¬ 
portions: we find a number of Planck clumps with only 
one source that is a starless core (/ s tariess= 1 ), or a YSO 
(/yso=1), but more often sources composed by a mixture of 
YSOs and starless cores with fractions ranging from 10% 
to 50%. Again this illustrates the variety of sources cov¬ 
ered by the PGCC catalogue, probing the ISM in extremely 
various evolutionary stages. 
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7.2.2. Gas tracers 

In order to characterize the gas content of the Planck cold 
clumps, several ground-based follow-ups of dense medium 
tracers have been performed. 

Wu et al. (2012) carried out a large survey of the 12 CO, 
13 CO and C ls O J-l— >0 transition, targeting 674 ECC 
clumps with the 13.7 m telescope of the Purple Mountain 
Observatory. All the clumps (except for one) have 12 CO 
and 13 CO detections, and 68% of them have C ls O emis¬ 
sion. The three line peak velocities are found to coincide, 
suggesting that the Planck clumps are quite cold and quies¬ 
cent. The comparison between the derived excitation tem¬ 
perature and the dust temperature shows that dust and gas 
are well coupled in 95% of the clumps. Ten of the ECC 
clumps were also mapped, evidencing sub-structures, such 
as cores, and filamentary/elongated morphologies. 

8. Conclusion 

The highest frequency bands of the Planck- HFI instru¬ 
ment provide an extremely powerful tracer of Galactic 
cold dust. By combining data from the Planck three up¬ 
per bands with IRAS 3 THz data, we have conducted a 
multi-frequency compact source detection, and generated 
the Planck Catalogue of Galactic Cold Clumps (PGCC). A 
first version of this catalogue was released in 2011, i.e., the 
Early Cold Core catalogue (ECC), together with the Early 
Release Compact Source Catalogue (ERCSC). At that time, 
915 sources, selected for their low temperature and high 
S/N, were made publicly available to the astronomical com¬ 
munity. With the present work, we are releasing the whole 
PGCC catalogue, containing 13188 Galactic sources and 54 
cold sources located in the LMC and SMC. 

We have applied the CoCoCoDeT algorithm (Montier 
et al., 2010) to the Planck 857, 545, and 353 GHz maps and 
to the IRAS 3 THz data. The combined use of these maps 
allow the separation of the cold and warm emission compo¬ 
nents, hence the identification of sources colder than their 
local environment. Through a dedicated analysis, we have 
removed all possible extragalactic contaminants. In partic¬ 
ular, we have cross-correlated the PGCC catalogue with 
publicly available extragalactic catalogues, leading to the 
rejection of less than one hundred sources, mostly at high 
Galactic latitude. In parallel, we have also discarded de¬ 
tections, which have turned out to be contaminated by the 
presence of nearby hot sources. Noteworthy, the final cat¬ 
alogue contains 54 sources located in the LMC and SMC. 
These sources have been kept in the catalogue because the 
proximity of these dwarf galaxies makes it possible to de¬ 
tect individual clouds with Planck. 

The PGCC sources have been divided into three cate¬ 
gories, depending on the quality of their flux density es¬ 
timates. 6993 sources have accurate photometry in both 
the Planck and IRAS bands. These sources correspond to 
FQ= 1 and are the most reliable in the catalogue. An amount 
of 3755 sources have accurate flux densities in the Planck 
bands but not in the IRAS 3 THz band. These sources cor¬ 
respond to FQ=2, and are likely so cold that their emission 
at 3 THz falls below the IRAS detection limit. The last cat¬ 
egory comprises 2440 sources, and for these no accurate 
flux density has been measured in at least two bands. These 
sources correspond to FQ=3 and might be intrinsically very 
faint or still deeply embedded. Despite the poor photome¬ 
try, they are considered real detections. 


We have combined seven independent methods to as¬ 
sign a distance estimate to 5574 sources. In the PGCC cat¬ 
alogue, for each source we quote all the available distances 
derived from the different methods, however, only the most 
reliable estimate is used to compute other source physi¬ 
cal quantities such as mass and luminosity. Distance esti¬ 
mates from different methods have been compared and val¬ 
idated. Accordingly, we have assigned, to each clump, a 
DIST_QUALITY flag: 464 sources have consistent distance 
estimates (DIST_QUALITY=1); 4191 sources have only one 
estimate (DIST_QUALITY=2); 255 have incompatible es¬ 
timates (DIST_QUALITY=3); 664 sources have only dis¬ 
tance upper limits (DIST_QUALITY=4). More detailed on 
this analysis can be found in the on-line version of the 
PGCC catalogue. The 4655 sources with an accurate dis¬ 
tance estimate (DIST_QUALITY=1 or 2) are mainly located 
in the solar neighbourhood, with about 85% of sources at 
less than 1 kpc, and 91.4% within 3 kpc from the Sun. 

We have obtained the temperature for 10748 sources, 
using a free (for sources with FQ=1) or fixed (j3=2, for 
sources with FQ=2) emissivity spectral index. The cat¬ 
alogue temperature distribution confirms that the PGCC 
clumps are not only colder than their local environment 
(as by construction of the catalogue) but, more importantly, 
that they are intrinsically cold sources, with a median be¬ 
tween 13 and 14.5 K, depending on the quality of the flux 
density measurements. The minimum temperature of the 
sources in the catalogue is 5.8 K, reached for sources with 
FQ=2. It is important to emphasize that this value is not 
a threshold artificially induced by our detection method, 
which in fact has been shown (through a MCQA analysis) 
to provide a 60% completeness level at temperatures as low 
as 6 K. Therefore, we can confidently state that, at least at 
the Planck angular resolution, no Galactic source is colder 
than 5.8 K. 

From the flux densities, temperature and distance es¬ 
timates, we have derived other physical properties of the 
PGCC clumps, namely: column density, physical size, 
mass, mean density and luminosity. The column density of 
the Planck clumps covers almost five orders of magnitude, 
reaching a value as low as 6.8xl0 18 cm~ 2 , which can be 
compared to the sensitivity limit (3xl0 21 cm~ 2 ) of the MSX 
absorption measurements used to detect IRDCs (Peretto & 
Fuller, 2010). Hence objects detected in emission by Planck 
with the CoCoCoDeT algorithm may not be detected in ab¬ 
sorption by MSX, meaning that the Planck PGCC sources 
might represent a larger class of objects than the IRDCs, 
and might include less dense and/or more deeply embed¬ 
ded objects. Furthermore, the PGCC sources are character¬ 
ized by a wide range of sizes and mean densities, which is 
indicative of a variety of astrophysical sources and evolu¬ 
tionary stages. The physical size of the catalogue sources 
ranges from 0.14pc to 30.6pc, i.e., from the typical size of 
a cold core to the one of a giant molecular cloud. Similarly 
the mean density spans four orders of magnitude, from 5.3 
to 3.5x10 4 cm -3 , encompassing the three categories intro¬ 
duced by Williams et al. (2000), that are cores, clumps and 
clouds. 

We emphasize that, although we have adopted the term 
clump to refer to the generic source in the PGCC cata¬ 
logue, we are aware that, depending on the distance, some 
of these sources are in fact cores, either pre- or proto-stellar, 
while others are giant molecular clouds. The preliminary 
Herschel and gas tracers follow-ups have confirmed that 
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the PGCC sources are indeed cold and dense environments, 
but have also shown that they often contain colder sub¬ 
structures (e.g., cores) and even warm components (e.g., 
YSOs). In the future, other follow-ups of this kind, as well 
as cross-correlations with already existing ancillary data 
sets (e.g., Herschel, WISE or AKARI), will become neces¬ 
sary to shed light on the exact nature of the Planck clumps. 

We believe that the PGCC catalogue, covering the 
whole sky, hence probing wildly different environments, 
represents a real goldmine for investigations of the early 
phases of star formation. These include, but are not limited 
to: i) studies of the evolution from molecular clouds to cores 
and the influence of the local conditions; ii) analysis of the 
extreme cold sources, such as the most massive clumps or 
those located at relatively high latitude; iii) characteriza¬ 
tion of the dust emission law in dense regions and the role 
of the environment. All these topics will be discussed in 
forthcoming publications. 
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Appendix A: LMC - SMC 

As noted in Sect. 3.3, the PGCC catalogue includes 
51 sources located within a 4?09 radius of the Large 
Magellanic Cloud (LMC), and three sources within a 2?38 
field centred on the Small Magellanic Cloud (SMC). Of the 
51 sources in the LMC, 42 sources have FQ= 1 or 2, while 
9 are only considered as poor detections. In the SMC, two 
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Fig.A.l. Distribution of the PGCC sources located in the 
LMC (in Galactic coordinates) for each FLUX_QUALITY cat¬ 
egory: "Reliable flux densities" (1, blue), "Missing 3 THz 
flux density" (2, green) and "Detection only" (3, pink). The 
grey scale image is the Planck intensity map at 857 GHz 
shown in log scale between 10 2 and 0.5M.lysr . 


sources have a good FLUX_QUALITY flag and one is only 
detected. 

At the distance of the Magellanic Clouds (Dlmc = 
50.1 kpc, Dsmc = 61.7 kpc. Walker, 2012; Hilditch et al„ 
2005), the working resolution of the Planck and IRAS maps 
(5') corresponds to spatial scales of ~ 80pc, which is com¬ 
parable to the characteristic size of giant molecular clouds 
(GMCs) in the Milky Way (~ 50pc, e.g. Blitz, 1993). 
Magellanic PGCC sources are therefore quite different ob¬ 
jects to Galactic PGCC sources, but they are still of consid¬ 
erable interest for studying the early phases of star forma¬ 
tion. Firstly, the Planck data provide a census of cold mate¬ 
rial in the vicinity of the Magellanic Clouds that is indepen¬ 
dent of previous observations of HI and CO emission, the 
standard tracers of neutral interstellar gas in external galax¬ 
ies. Secondly, PGCC sources in the Magellanic Clouds con¬ 
stitute a useful comparison sample to local GMCs with 
low levels of star formation, since the galactic environ¬ 
ment hosting the cold molecular material is quite distinct. 
The clouds span a small but appreciable range of sub-solar 
metallicities (0.2 to 0.5, Russell & Dopita, 1992), and have 
dust-to-gas mass ratios that are ~ 3 to 10 times lower than 
the value in the solar neighborhood (Gordon et al., 2014). 
They are also low-mass systems, with shallower gravita¬ 
tional potentials and lower levels of shear than the normal 
disc galaxies. 

Of the 51 PGCC sources detected towards the LMC, 
34 are located within the field surveyed by NANTEN at 
2.'6 resolution for CO emission in the LMC (Fukui et ah, 
2008), while 27 sources are located inside the region ob¬ 
served by the higher resolution (45") MAGMA LMC sur¬ 
vey (Wong et ah, 2011). All of these 27 sources exhibit CO 
emission that is well-detected by MAGMA, with peak in¬ 
tegrated 12 CO J— 1 —>0 intensities brighter than MAGMA’s 
4cr sensitivity limit (~ 1.2 K.km.s -1 ). Dedicated follow-up 
observations of the remaining Magellanic PGCC sources 
with the Mopra Telescope have detected CO emission as¬ 
sociated with a further 15 PGCC sources in the LMC, but 
none of the three SMC sources (Hughes, 2015 in prepara¬ 
tion). Figure A. 1 shows the spatial distribution of all PGCC 
sources in the Magellanic Clouds. The spatial distribution 
of PGCC sources in the LMC is clearly not random: only 
one PGCC source is detected at high stellar surface den¬ 
sities (E* > 100 Mpc 2 ) even though many CO clouds are 
detected there, while there are four PGCC sources that ap¬ 
pear to be aligned in an east-west direction along the south¬ 
ern periphery of the LMC. These sources were previously 
noted in the dust property maps obtained by combining the 
IRAS and the Planck data as regions of low temperature 
and high dust column density. 

In the LMC, the CoCoCoDeT algorithm therefore seems 
to be an efficient tool for identifying cold molecular mate¬ 
rial. After re-scaling for the lower dust-to-gas ratio in the 
LMC and accounting for mismatches between the intrin¬ 
sic source size and the Planck and Mopra beam widths, 
the catalogued masses of the LMC PGCC sources are in 
good agreement with the masses derived from the Mopra 
CO data. A more detailed investigation of the spatial distri¬ 
bution and physical nature of the LMC PGCC sources will 
be presented in Hughes (2015 in preparation). 
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Fig. B.l. All-sky distribution of the PGCC sources with FQ=1 or 2, displayed over the Planck I2 CO /=1—>0 map ranging 
from -5 to 30 K.kms -1 . The locations of the molecular complexes used in the distance estimate procedure are shown as 
red circles. 


Appendix B: Correlation with CO map 

In Fig. B.l we overlay the all-sky distribution of the PGCC 
Galactic clumps to the Planck 12 CO J-l —>0 all-sky map 
(see Planck Collaboration XIII, 2014). As expected, for 
a large majority of the PGCC sources, their location co¬ 
incides with a I2 CO /=1 —>0 transition. This is especially 
true in the Galactic disc. At high latitude, objects appear 
less associated with a CO signature, although this is likely 
due to the limited sensitivity of the Planck CO map. A fur¬ 
ther analysis is required to investigate whether the PGCC 
sources are associated to CO clumps, or only with diffuse 
CO emission. 


Appendix C: Catalogue content 
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Table C.l. PGCC catalogue columns. 


Column Name 


Unit 


Description 


NAME . . 
SNR 

SNR.857 

SNR.545 

SNR.353 


GLON 
GLAT 
RA . 
DEC 


GAU_MAJOR_AXIS . 

GAU_MAJOR_AXIS_SIG . . . 

GAU_MINOR_AXIS . 

GAU_MINQR_AXIS_SIG . . . 
GAU_POSITION_ANGLE . .. 

GAU_POSITION.ANGLE.SIG 


FLUX.3000.CLUMP . 

FLUX.3000_CLUMP.SIG . . . 

FLUX.857.CLUMP . 

FLUX.857_CLUMP.SIG 

FLUX.545.CLUMP . 

FLUX.545_CLUMP.SIG 

FLUX.353.CLUMP . 

FLUX.353_CLUMP.SIG 

FLUX.3000.WBKG . 

FLUX.3000_WBKG.SIG 

FLUX.857.WBKG . 

FLUX.857_WBKG.SIG . 

FLUX.545.WBKG . 

FLUX.545_WBKG.SIG . 

FLUX.353.WBKG . 

FLUX.353_WBKG.SIG . 

FLUX.QUALITY . 

FLUX.BLENDING . 

FLUX.BLENDING.IDX . 

FLUX.BLENDING.ANG.D 1ST 

FLUX.BLENDING_BIAS.3000 

FLUX.BLENDING_BIAS.857 

FLUX.BLENDING_BIAS.545 

FLUX.BLENDING_BIAS.353 


DIST.KINEMATIC . 

DIST.KINEMATIC.SIG . 

DIST.0PT_EXT.DR7 . 

DIST.0PT_EXT_DR7.SIG . 

DIST.0PT_EXT.DR9 . 

DIST.0PT_EXT_DR9.SIG . 

DIST.NIR_EXT.IRDC . 

DIST_NIR_EXT_IRDC_SIG 

DIST.NIR.EXT . 

DIST.NIR_EXT.SIG . 

DIST.M0LECULAR.C0MPLEX . . . 
DIST.M0LECULAR.C0MPLEX.SIG 

DIST.HKP.GCC . 

DIST.HKP_GCC.SIG . 

DIST.0PTI0N . 

DIST.QUALITY . 

DIST . 

DIST.SIG . 


[deg] 

[deg] 

[deg] 

[deg] 


[arcmin] 

[arcmin] 

[arcmin] 

[arcmin] 

[rd] 

[rd] 


[Jy] 

[jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[Jy] 

[1-3] 

[ 0 , 1 ] 

[arcmin] 

[%] 

[%] 

[%] 

[%] 


[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[kpc] 

[0-7] 

[0-4] 

[kpc] 

[kpc] 


Identification 
Source Name 

Maximum S/N over the 857, 545, and 353 GHz Planck cold residual maps 
S/N of the cold residual detection at 857 GHz 
S/N of the cold residual detection at 545 GHz 
S/N of the cold residual detection at 353 GHz 

Source position 

Galactic longitude based on morphology fitting 

Galactic latitude (deg) based on morphology fitting 

Right ascension (J2000) in degrees transformed from (GLON, GLAT) 

Declination (J2000) in degrees transformed from (GLON, GLAT) 

Morphology 

FWHM along the major axis of the elliptical Gaussian 

lcr uncertainty on the FWHM along the major axis 

FWHM along the minor axis of the elliptical Gaussian 

lcr uncertainty on the FWHM along the minor axis 

Position angle of the elliptical gaussian, defined as the clockwise angle 

between the Galactic plane orientation and the orientation of the major axis 

lcr uncertainty on the position angle 

Photometry 

Flux density of the clump at 3 THz 

lcr uncertainty on the flux density of the clump at 3 THz 

Flux density of the clump at 857 GHz 

lcr uncertainty on the flux density of the clump at 857 GHz 

Flux density of the clump at 545 GHz 

lcr uncertainty on the flux density of the clump at 545 GHz 

Flux density of the clump at 353 GHz 

lcr uncertainty on the flux density of the clump at 353 GHz 

Flux density of the warm background at 3 THz 

lcr uncertainty on the flux density of the warm background at 3 THz 

Flux density of the warm background at 857 GHz 

lcr uncertainty on the flux density of the warm background at 857 GHz 

Flux density of the warm background at 545 GHz 

lcr uncertainty on the flux density of the warm background at 545 GHz 

Flux density of the warm background at 353 GHz 

lcr uncertainty on the flux density of the warm background at 353 GHz 

Category of flux density reliability 

1 if blending issue with flux density estimate 

Catalogue index of the closest source responsible for blending 

Angular distance to the closest source responsible for blending 

Relative bias of the flux density at 3 THz due to blending 

Relative bias of the flux density at 857 GHz due to blending 

Relative bias of the flux density at 545 GHz due to blending 

Relative bias of the flux density at 353 GHz due to blending 

Distance 

Distance estimate [1] using kinematics 

lcr distance estimate using kinematics 

Distance estimate [2] using optical extinction on SDSS DR7 

lcr distance estimate using optical extinction on SDSS DR7 

Distance estimate [3] using optical extinction on SDDS DR9 

lcr distance estimate using optical extinction on SDSS DR9 

Distance estimate [4] using near-infrared extinction towards IRDCs 

lcr distance estimate using near-infrared extinction towards IRDCs 

Distance estimate [5] using near-infrared extinction 

lcr distance estimate using near-infrared extinction 

Distance estimate [6] using molecular complex association 

lcr distance estimate using molecular complex association 

Distance estimate [7] from the Herschel Key-Programme Galactic Cold Cores 

lcr distance estimate from the Herschel HKP-GCC 

Option of the best distance estimate used in other physical properties 

Quality Flag of the consistency between distance estimates 

Best distance estimate used for further physical properties 

lcr uncertainty on the best distance estimate 
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Table C.2. PGCC catalogue columns (continued). 


Column Name 


Unit 


Description 


Temperature 

TEMP.CLUMP. [K] Temperature of the clump with /3 as a free parameter 

TEMP_CLUMP_SIG . [K] lcr uncertainty on the clump temperature with/? free 

TEMP_CLUMP_L0W1 . [K] Lower 68% confidence limit of the clump temperature with/? free 

TEMP_CLUMP_UP1 . [K] Upper 68% confidence limit of the clump temperature with /? free 

BETA.CLUMP. Spectral index f3 of the clump 

BETA_CLUMP_SIG . lcr uncertainty (from MCMC) on the emissivity spectral index /? of the clump 

BETA_CLUMP_L0W1 . Lower 68% confidence limit of the emissivity spectral index /? of the clump 

BETA_CLUMP_UP1 . Upper 68% confidence limit of the emissivity spectral index /? of the clump 

TEMP_BETA2_CLUMP. [K] Temperature of the clump with /? = 2 

TEMP_BETA2_CLUMP_SIG . [K] lcr uncertainty on the temperature of the clump with /? = 2 

TEMP_BETA2_CLUMP_L0W1 . [K] Lower 68% confidence limit of the clump temperature with /? = 2 

TEMP_BETA2_CLUMP_UP1 . [K] Upper 68% confidence limit of the clump temperature with /? = 2 

TEMP.WBKG . [K] Temperature of the warm background with /? as a free parameter 

TEMP_WBKG_SIG . [K] lcr uncertainty on the temperature of the warm background with /? as a free parameter 

TEMP_WBKG_L0W1 . [K] Lower 68% confidence limit of the warm background temperature with/? free 

TEMP_WBKG_UP1 . [K] Upper 68% confidence limit of the warm background temperature with /? free 

BETA.WBKG . Spectral index /? of the warm background 

BETA_WBKG_SIG . lcr uncertainty on the spectral index /? of the warm background 

BETA_WBKG_L0W1 . Lower 68% confidence limit of the emissivity spectral index /? of the warm background 

BETA_WBKG_UP1 . Upper 68% confidence limit of the emissivity spectral index /? of the warm background 

TEMP_BETA2_WBKG . [K] Temperature of the warm background with /? = 2 

TEMP_BETA2_WBKG_SIG . [K] lcr uncertainty on the temperature of the warm background with /? = 2 

TEMP_BETA2_WBKG_L0W1 . [K] Lower 68% confidence limit of the warm background temperature with /? = 2 

TEMP_BETA2_WBKG_UP1 . [K] Upper 68% confidence limit of the warm background temperature with /? = 2 


Physical properties 

NH2 . [cm -2 ] Column density N m of the clump 

NH2J3IG. [cm -2 ] lcr uncertainty on the column density of the clump 

NH2_L0W[ 1,2,3] . [cm -2 ] Lower 68%, 95% and 99% confidence limit of the column density 

NH2_UP[1,2,3] . [cm -2 ] Upper 68%, 95% and 99% confidence limit of the column density 

MASS . [Mq] Mass estimate of the clump 

MASS.SIG. [Mq] lcr uncertainty on the mass estimate of the clump 

MASS_L0W[1,2,3] . [Mq] Lower 68%, 95% and 99% confidence limit of the mass estimate 

MASS_UP[ 1,2,3] . [Mq] Upper 68%, 95% and 99% confidence limit of the mass estimate 

DENSITY . [cm -3 ] Mean density of the clump 

DENSITY_SIG . [cm -3 ] lcr uncertainty on the mean density estimate of the clump 

DENSITY_LOW[ 1,2,3] [cm -3 ] Lower 68%, 95% and 99% confidence limit of the mean density estimate 

DENSITY_UP[1,2,3] . [cm -3 ] Upper 68%, 95% and 99% confidence limit of the mean density estimate 

SIZE . [pc] Physical size of the clump 

SIZE.SIG. [pc] lcr uncertainty on the physical size estimate of the clump 

SIZE_L0W[1,2,3] [pc] Lower 68%, 95% and 99% confidence limit of the physical size estimate 

SIZE_UP[ 1,2,3] . [pc] Upper 68%, 95% and 99% confidence limit of the physical size estimate 

LUMINOSITY. [Lq] Luminosity of the clump 

LUMINOSITY.SIG . [Lq] lcr uncertainty on the luminosity of the clump 

LUMIN0SITY_L0W[ 1,2,3]. [Lq] Lower 68%, 95% and 99% confidence limit of the luminosity 

LUMINOSITY_UP[ 1,2,3] [Lq] Upper 68%, 95% and 99% confidence limit of the luminosity 


Flags 


NEARBY_HOT_SOURCE. [arcmin] Distance to the closest hot source 

XFLAG_LMC . [0,1] 1 if part of the LMC 

XFLAGJ3MC . [0,1] 1 if part of the SMC 

XFLAG_ECC . [0,1] 1 if present in the ECC 

XFLAG_PCCS_857 . [0,1] 1 if present in the PCCS 857 GHz band 

XFLAG_PCCS_545 . [0,1] 1 if present in the PCCS 545 GHz band 

XFLAG_PCCS_353 . [0,1] 1 if present in the PCCS 353 GHz band 

XFLAG_PCCS_217 . [0,1] 1 if present in the PCCS 217 GHz band 

XFLAG_PCCS_143 . [0,1] 1 if present in the PCCS 143 GHz band 

XFLAG_PCCS_100 . [0,1] 1 if present in the PCCS 100 GHz band 

XFLAG_PCCS_70 . [0,1] 1 if present in the PCCS 70 GHz band 

XFLAG_PCCS_44 . [0,1] 1 if present in the PCCS 44 GHz band 

XFLAG_PCCS_30 . [0,1] 1 if present in the PCCS 30 GHz band 

XFLAG_PSZ . [0,1] 1 if present in the PCCS SZ 

XFLAG_PHZ . [0,1] 1 if present in the PCCS HZ 

XFLAG_HKP_GCC . [0,1] 1 if present in the Herschel HKP-GCC 
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